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Description 

FIELD OF THE INVENTION 

The invention is in the area of plant molecular 
biology and concerns plant genetic engineering by 
recombinant DNA technology. The identification 
and characterization of a segment of DNA from the 
upstream nontranscribed region of a plant ubiquitin 
gene are described. This segment is capable of 
initiating and driving the transcription of nearby 
plant expressible genes in recombinant DNA-con- 
tainlng tissue from both monocotyledonous and 
dicotyledonous plants. The described DNA seg- 
ment will enable the selective expression and regu- 
lation of desired structural genes in plant tissue. 

BACKGROUND OF THE INVENTION 

Ubiquitin is an 8.5 kDa protein found in 
eukaryotic cells in either the free, monomeric state 
or covalently joined to various cytoplasmic, nuclear 
or membrane proteins. This protein contains 76 
amino acid residues and its amino acid sequence 
is conserved to an unusually high extent. The se- 
quence of ubiquitin is identical between species as 
diverse as human, cow, Mediten-anean fruit fly. 
Xenopus and chicken (U. Bond and IVI. Schlesinger 
(1985) Mol. Cell. Biol. 5:949-956). Yeast and human 
ubiquitin differ by only three different amino acids 
(K. Ozkaynak et al. (1984) Nature 312: 663-666). 
while plant ubiquitin differs from that of yeast by 
two amino acids. Based on this two or three amino 
acid difference in sequence, there appear to be at 
least 3 types of ubiquitin - animal, plant and yeast. 

Ubiquitin is found in three major cellular com- 
partments - the cytoplasmic membrane, the cytopl- 
asm and the nucleus. This protein is required for 
ATP-dependent degradation of intracellular pro- 
teins, a non-lysosomal pathway to eliminate from 
the cell those proteins that are damaged or ab- 
normal as well as riormal proteins having a short 
half-life (A. Hershko et al. (1984) Proc. Natl. Acad. 
Sci. USA 81:1619-1623; D. Finley et al. (1985) 
Trends Biol. Sci. 10:343-347). Ubiquitin binds to a 
target protein, tagging it for degradation. The cova- 
lent attachment is through isopeptide linkages be- 
tween the carboxylterminus (glycine) in ubiquitin 
and the €-amino group of lysyl side chains in the 
target proteins. 

Ubiquitin also plays a role in the cellular re- 
sponse to stresses, such as heat shock and In- 
crease in metal (arsenite) concentration (D. Finley 
et al. (1985) supra ). Most living cells respond to 
stress (for example, exposure to temperatures a 
few degrees above normal physiological tempera- 
tures, or to elevated concentrations of heavy met- 
als, ethanol. oxidants and amino acid analogs) by 



activating a small set of genes to selectively syn- 
thesize stress proteins, also called heat shock pro- 
teins. In most organisms these stress proteins were 
found to have subunit molecular weights of 89, 70 
5 and 23 kDa (U. Bond and M. Schlesinger (1985) 
supra ). Ubiquitin. with a molecular weight of ap- 
proximately 8.5 kDa, also responds to stress, since 
in different species (yeast, mouse, gerbil and 
chicken embryo fibroblasts) the levels of ubiquitin 
70 mRNA and ubiquitin protein increase as a result of 
different stress conditions. 

In eukaryotic systems the expression of genes 
is directed by a region of the DNA sequence called 
the promoter. In general, the promoter is consid- 
75 ered to be that portion of the DNA. upstream from 
the coding region, that contains the binding site for 
RNA polymerase II and initiates transcription of the 
DNA. The promoter region also comprises other 
elements that act as regulators of gene expression. 
20 These include a TATA box consensus sequence in 
the vicinity of about -30, and often a CAAT box 
consensus sequence at about -75 bp 5' relative to 
the transcription state site, or cap site, which is 
defined as +1 (R. Breathnach and P. Chambon 
25 (1981) Ann. Rev. Biochem. 50:349-383; J. Messing 
et al. (1983) in Genetic Engineering of Plants , eds. 
T. Kosuge, CP. Meredith and A. Hollaender, pp. 
211-227). In plants the CAAT box may be substi- 
tuted by the AGGA box (J. Messing et al. (1983) 
30 supra ). Other regulatory elements that may be 
present are those that affect gene expression in 
response to environmental stimuli, such as illu- 
mination or nutrient availability, or to adverse con- 
ditions, such as heat shock, anaeroblosis or the 
35 presence of heavy metal. In addition, there may be 
present DNA sequences which control gene ex- 
pression during development, or in a tissue-specific 
fashion. Other regulatory elements that have been 
found are the enhancers (in animal systems) or the 
40 upstream activating sequences (in yeast), that act 
to elevate the overall expression of nearby genes 
in a manner that is independent of position and 
orientation with respect to the nearby gene. Se- 
quences homologous to the animal enhancer core 
45 consensus sequence, 5'-GGTGTGGAAA(orTTT)G- 
3', have been described in plants, for example, in 
the pea legumin gene at about position -180 rela- 
tive to the transcription state site (G. Lycett et al. 
(1984) Nucleic Acids Res. 12:4493-4506) and in the 
50 maize Adh1 and Adh2 genes at about -200 and 
-170 bp, respectively, from the transcription state 
site. In general, promoters are found 5'. or up- 
stream, relative to the start of the coding region of 
the corresponding gene and this promoter region, 
55 comprising all the ancillary regulatory elements, 
may contain between 100 and 1000 or more 
nucleotides. 
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Of the regulatory elements controlling gene 
expression, the heat shock element is perhaps one 
of the most widely studied. Although the universal- 
ity of cellular response to heat shock has been 
known for almost a decade, very little is known yet 
about the function of the heat shock proteins selec- 
tively synthesized by the stressed cell. The induc- 
tion of stress protein synthesis occurs at a tran- 
scription level and the response has been found to 
be similar in bacteria, fungi, insects and mammals 
(E. Craig (1985) CTC Crit. Rev. Biochem. 18:239- 
280). In addition to the synthesis and accumulation 
of the classic heat shock proteins in response to 
stress, cells that are stressed also synthesize prot- 
eases and ubiquitin. In E. coli , a 94 kDa enzyme 
that has an ATP-depondent proteolytic activity is 
encoded by the Ion (cap R) gene whose expression 
is under control of the heat shock regulon (E. 
Ozkaynak et al. (1984) Nature 312:663-666). In 
chicken embryo fibroblasts (U. Bond et al. (1985) 
Mol. Cell. Biol. 5:949-956) the ubiquitin mRNA level 
increased five fold after heat shock or after expo- 
sure to 50 uM arsenite. Each mRNA comprises 
tandemly repeated identical polypeptides which, 
upon translation as a polyubiquitin molecule, gives 
rise to multiple ubiquitin molecules, offering a dis- 
tinctive mechanism for amplifying genetic informa- 
tion. This elevated level of ubiquitin mRNA does 
not persist during the recovery phase after heat 
shock, indicating a transient role for free ubiquitin 
during the stress response. 

It has been postulated (J. Anathan et al. (1986) 
Science 232:522-524) that metabolic stresses that 
trigger the activation of heat shock protein genes 
act through a common mechanism. The metabolic 
stresses that activate heat shock genes cause de- 
naturation of Intracellular proteins; the accumulation 
of abnormal proteins acts as a signal to activate 
heat shock genes. A role for ubiquitin in targeting 
abnormal proteins for degradation, as well as for 
different proteolytic enzymes, would be compatible 
with such a model of heat shock protein gene 
regulation. 

I^ost of the early work on heat shock genes 
was done with Drosophila species. In particular, the 
Drosophila hsp70 gene was used widely in recom- 
binant studies. In homologous systems, the 
Drosophila hsp70 gene was fused to the E. coli ^- 
galactosidase structural gene to allow the activity of 
the hybrid gene to be distinguished from the five 
resident hsp70 heat shock genes in the recipient 
Drosophila. Drosophila heat shock genes were also 
introduced into heterologous systems, e.g., in mon- 
key COS cells and mouse cells (H. Pelham (1982) 
Cell 30:517-528). Regulation by heat shock was 
observed In the hybrid hsp70-lac Z gene which 
was integrated into the Drosophila germ line and 
into which a 7 kb E.coli /3-galactosidase DNA frag- 



ment was Inserted into the middle of the hsp70 
structural gene. The resultant j8-galactosidase ac- 
tivity in the transformants was shown (J. Lis et al. 
(1983) Cell 35:403-410) to be regulated by heat 
5 shock. 

The DNA sequence conferring heat shock re- 
sponse was identified by deletion analysis of the 
Drosophila hsp70 heat shock promoter to be 5'- 
CTGGAAT_TTCTAGA-3' (H. Pelham et al. (1982) 
10 in Heat Shock From Bacteria to Man . Cold Spring 
Harbor Laboratory, pp. 43-48) and is generally lo- 
cated in the -66 through -47 region of the gene or 
approximately 26 bases upstream of the TATA box. 
It was further demonstrated that a chemically syn- 
15 thesized copy of this element, when placed up- 
stream of the TATA box of the herpes virus 
thymidine kinase gene in place of the normal up- 
stream promoter element, was sufficient to confer 
heat induclbility upon the thymidine kinase gene in 
20 monkey COS cells and in Xenopus oocytes. (The 
thymidine kinase gene is normally not heat induc- 
ible.) These heat shock sequences interact with 
heat shock specific transcription factor(s) which al- 
low the induction of heat shock proteins (C. Parker 
25 et al. (1984) Cell 37:273-283). Inducers of heat 
shock genes could be factors that alter (decrease) 
the concentration of heat shock proteins within the 
cell and, thus, control the transcription and transla- 
tion of heat shock genes. 
30 In higher plants, the stress response was dem- 

onstrated by increased protein synthesis in re- 
sponse to heat shock in soybean, pea, millet, corn, 
sunflower, cotton and wheat (T. Barnett et al. 
(1980) Dev. Genet. J[:331-340; J. Key et al. (1981) 
35 Proc. Natl. Acad. Sci. USA 78:3526-3530). The 
major differences in heat shock response seen 
among plant species are: (a) the amount of total 
protein synthesized in response to stress, (b) the 
size distribution of the different proteins synthe- 
40 sized, (c) the optimum temperature of induction of 
heat shock proteins and (d) the lethal (breakpoint) 
temperature. High molecular weight proteins are 
found to be electrophoretically similar among dif- 
ferent species. The low molecular weight (15-27 
45 kDa) heat shock proteins show more elec- 
trophoretic heterogeneity between species. In 
plants, the higher molecular weight proteins resem- 
ble those produced in Drosophila. There Is a 
marked difference, however, in the complexity of 
50 the low molecular weight heat shock proteins be- 
tween plants and Drosophila. Four heat shock pro- 
teins, 22, 34. 36 and 27 kDa, are synthesized in 
Drosophila, whereas soybean produces over 20 
heat shock proteins having molecular weights in 
55 the range of 15-18 kDa. The low molecular weight 
protein genes In soybeans are the most actively 
expressed and coordinately regulated genes under 
heat shock conditions (F. SchoffI et al. (1982) J. 
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Mol. Appl. Genet. 1^:301-314). 

Key et al. (EPO Application No. 85302593.0. 
filed April 12, 1985) have studied the promoter 
region of plant heat shock genes. Four soybean 
heat shock genes (three genes coding for 15-18 
kDa heat shock genes (three genes coding for IS- 
IS kDa heat shock proteins and one genes coding 
for a 17.3 kDa heat shock protein) were cloned and 
sequenced. The coding sequences and flanking 
sequences of the four heat shock genes were de- 
termined. The promoter regions of these four 
genes were subcloned. linked to a T-DNA shuttle 
vector and transferred into Agrobacterium 
tumefaciens . One of the recombinant clones of a 
soybean heat shock gene coding for a 15-18 kDa 
protein contained an open reading frame of 462 
nucleotides and a 291 nucleotide promoter region 
upstream of the ATG translation initiation codon. 
The promoter Included the TATA box. the CAAT 
box, the transcription initiation site and a heat 
shock consensus sequence 131-144 nucleotides 
upstream of the ATG translation start codon with 

the sequence 5'-CT_GAA ^TTC__AG-3'. Only 

three of the four clones showed substantial homol- 
ogy in the promoter region, but there were strong 
similarities between the heat shock consensus se- 
quences of all four clones. Significantly, the coding 
sequence, the upstream promoter region and the 
downstream flanking region of the four soybean 
heat shock genes had almost no resemblance to 
the corresponding regions of Drosophila heat shock 
genes. Although there were similarities between the 
consensus sequence of the promoter region from 
Drosophila and soybean heat shock genes, the 
promoter regions of soybean heat shock genes did 
not possess the inverted repeat sequences char- 
acteristic of Drosophila genes. 

The promoter region from the soybean heat 
shock genes was used to activate a soybean gene 
and a foreign gene (one normally not found in 
soybean) and to show regulation of the response 
by stress (Key et al.. EPO Application No. 
85302593.0, filed April 12. 1985). The promoter 
was isolated from the soybean SB 13 heat shock 
gene as a DNA fragment extending 65 bp down- 
stream from the start of transcription to include a 
major portion of the untranslated leader sequence 
but not the start codon for translation. A ^-galac- 
tosidase gene was placed under the control of the 
heat shock promoter within the T-DNA of the Ti- 
plasmid in a stable form within A. tumefaciens , and 
then was transfenred to a plant or plant cell culture. 
The actuality of DNA transfer was recognized by 
the expression of the )3-galactosidase gene as the 
production of a blue color after heat treatment in a 
medium containing the 5-bromo-4-chloro-3-indolyl- 
/8-D-galactosidase substrate molecule (M. Rose et 
al. (1981) Proc. Natl. Acad. Sci. USA 78:2460- 



2464). 

Experimentation with cross expression wherein 
a gene from one plant species is examined for 
expression in a different species adds a further 
6 dimension to the understanding of specific function. 
These experiments may embody the insertion of a 
gene under the control of its own promoter or of a 
gene artificially fused to a different or unnatural 
promoter. In 1983 Murai et al. (Science 222:476- 
fo 482) obtained expression of the phaseolin gene 
from Phaseolus vulgaris L. in sunflower (Helianthus) 
tissue under two sets of conditions: (!) when the 
Phaseolin gene was under the control of its own 
promoter and (ii) when the gene was spliced to. 
fs and under the control of a T-DNA promoter. In 
subsequent experiments It was shown that the 
phaseolin structural gene under the control of its 
natural promoter could be expressed in tobacco 
and that the tissue-specific expression in the het- 
20 erologous host (tobacco) was similar to that in the 
native host (bean) (C. Sengupta-Gopalen et al. 
(1985) Proc. Natl. Acad. Sci. USA 82:3320-3324). 

In later experiments (J. Jones et al. (1985) 
EMBO J. 4:2411-2418) the expression of the oc- 
25 topine synthetase gene (ocs) was described in both 
regenerated transformed homologous (petunia) and 
heterologous (tobacco) plants. In this study the ocs 
gene was fused to the promoter of a petunia chlo- 
rophyll a/b binding protein. Cross-expression was 
30 obtained by W. Gurley et al. (1986) Mol. Cell. Biol. 
6:559-565; and Key et al., EPO Application No. 
85302593.0. filed April 12, 1985, who reported 
strong transcription in sunflower tumor tissue of a 
soybean heat shock gene under control of its own 
36 promoter. In this case functional activity was mea- 
sured as the correct thermal induction response. 

The first evidence for transcription initiated 
from a monocotyledon promoter in a dicotyledon 
host plant was published by Matzke et ai. (1984) 
40 EMBO J. 3:1525-1531. These workers cloned the 
maize zein Z4 gene and introduced it on a Xs- 
derived vector into sunflower stemlets. The ensuing 
zein mRNA could then be translated in a wheat 
germ system but not In the transformed sunflower 
45 calli. 

In a later study the wheat gene whAB1.6 en- 
coding the major chlorophyll a/b binding protein 
was cloned Into a T-DNA-containIng vector and 
transferred to both petunia and tobacco (G. Lam- 

50 ppa et al. (1985) Nature 316:750-752). Expression 
was obtained in both the monocotyledon and di- 
cotyledon hosts and was determined to be light- 
induced and tissue-specific. In a more recent 
study. Rochester et al. (1986) EMBO J. 5:451-458) 

55 obtained expression of the maize heat shock hsp70 
gene in transgenic petunia. The maize hsp70 
mRNA was synthesized only in response to ther- 
mal stress. So far. these three studies constitute 
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the total number of published reports describing 
successful expression of monocot genes in trans- 
genic dicot plants. However, there are also nega- 
tive reports describing mininrial or no expression of 
maize alcohol dehydrogenase gone In tobacco 
hosts (Llewellyn et al. (1985) in Molecular Form 
and Function of the Plant Genome , L. van Vloten- 
Doting, G.S. Groot and T. Hall (eds.), Plenum Pub- 
lishing Corp., pp. 593-608; J.G. Ellis et al. (1987) 
EMBO J. 6:11-16), suggesting a possible inherent 
species-specific difference between monocot and 
dicot promoters. 

The heat shock response Is believed to provide 
thermal protection or thermotolerance to othenwise 
nonpermisslve temperatures (M. Schlesinger et al. 
(1982) in Heat Shock from Bacteria to Man , Cold 
Spring Harbor Laboratory, Cold Spring Harbor. 
New York. p. 329). A permissive heat shock tem- 
perature is a temperature which is high enough to 
induce the heat shock response but not high 
enough to be iethal. Thermotolerance in plant 
seedlings can be attained by different treatment 
regimes: (a) a 1 to 2 hour exposure to continuous 
heat shock at 40 • C followed by a 45 " C incubation, 
(b) a 30 minute heat shock at 40*C followed by 2 
to 3 hours at 28 • C prior to the shift to 45 • C. (c) a 
10 minute heat shock at 45"C followed by about 2 
hours at 28 • C prior to the shift to 45 ' C and (d) 
treatment of seedlings with 50 uM arsenite at 
28*0 tor 3 hours or more prior to the shift to 
45 'C. During the pretreatment prior to incubation 
at the potentially lethal temperature, heat shock 
proteins are synthesized and accumulated. Also, 
heat shock mRNA and protein syntheses occur at 
45 "C, if the plant seedling is preconditioned as 
described above. When the temperature is shifted 
back to physiological levels (e.g., 28*0), normal 
transcription and translation are resumed and after 
3 to 4 hours at normal temperature, there is no 
longer detectable synthesis of heat shock proteins 
(J. Key et al. (1981) Proc. Natl. Acad. Sci. USA 
78:3526-3530; M. Schlesinger et al. (1982) Trends 
Biochem. Sci. 2:222-225). The heat shock proteins 
that were synthesized during the 40 • C heat shock 
treatment are very stable and are not immediately 
degraded. 

Although ubiquitin is regulated in response to 
environmental stress, including heat shock, the reg- 
ulation of ubiquitin transcription differs from that of 
classical heat shock protein transcripts. Both 
ubiquitin and heat shock protein mRNA levels are 
elevated in response to cellular stress. However, 
whereas classical heat shock proteins accumulate 
during heat shock and persist during the recovery 
phase, ubiquitin mRNAs accumulated during heat 
shock are rapidly degraded within hours after 
stress treatment. This unstable mRNA transcript 
suggests a specialized but transient role for 



ubiquitin during heat shock, and implicates a 
unique DNA sequence in the ubiquitin gene pro- 
moter region, specifying specialized regulatory 
control during cellular response to stress. 

6 

SUMMARY OF THE INVENTION 

The primary object of this invention is to pro- 
vide novel DNA segments and constructions com- 

10 prising a regulatory promoter system which will 
enable those skilled in the art to selectively ex- 
press structural genes In plant tissue. The promoter 
comprises the DNA sequences from the 5' nontran- 
scribed regions of plant ubiquitin genes that initiate 

IS and regulate the transcription of genes placed un- 
der its control. In its preferred embodiment, the 
promoter sequence is derived from the upstream 
region of the ubiquitin gene from maize. 

The isolation and characterization of a pro- 

20 meter which is active in plants to control and 
regulate the expression of a downstream gene is 
described in the present work. This DNA sequence 
is found as a naturally occurring region upstream of 
the ubiquitin structural gene isolated from a maize 

25 genomic library. The transcription start site or cap 
site as determined by SI nuclease mapping is 
designated as base 1 and the sequences em- 
bodied within about 899 bases 5' of the transcrip- 
tion start site plus about 1093 bases 3' of the cap 

30 site but 5' of the translation start site constitute the 
ubiquitin promoter. Located within this approxi- 
mately 2kb promoter region are a TATA box (-30), 
two overlapping heat shock consensus elements (- 
204 and -214), an 83 nucleotide leader sequence 

35 Immediately adjacent to the transcription start site 
and an intron extending from base 84 to base 
1093. 

A further object of this Invention is to provide a 
recombinant DNA molecule comprising a plant ex- 

40 pressible promoter and a plant expressible struc- 
tural gene, wherein the structural gene is placed 
under the regulatory control of all transcription ini- 
tiating and activating elements of the promoter. In 
particular, the plant ubiquitin promoter can be com- 

45 bined with a variety of DNA sequences, typically 
structural genes, to provide DNA constructions for 
regulated transcription and translation of said DNA 
sequences and which will allow for regulated con- 
trol of expression when stressed with elevated tem- 

50 peratures. 

Such recombinant DNA molecules are intro- 
duced into plant tissue so that the promot- 
er/structural gene combination is expressed. It is 
contemplated that the method of the present inven- 

55 tlon is generally applicable to the expression of 
structural genes In both monocotyledonous and 
dicotyledonous plants. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is an analysis of a maize ubiquitin 
genomic clone. (A) Restriction map of ubiquitin 
gene. 7,2b1. (B) Restriction map of two subcloned 
PstI fragments of ubiquitin gene 1. (C) Scliematic 
representation of maize ubiquitin gene 1 organiza- 
tion. The 5' untranslated exon is indicated by the 
open box and the tandem ubiquitin coding regions 
are indicated by the numbered boxes. 

Figures 2-1 to 2-7 document the DNA se- 
quence and the deduced amino acid sequence of 
ubiquitin gene 1. The start of transcription as deter- 
mined by SI nuclease mapping is denoted as base 
1. Sequences representing the putative "TATA" 
box (-30) and the overlapping heat shock consen- 
sus sequences (-214 and -204) are underlined. The 
Intron extends from base 84 to base 1093 and the 
polyubiquitin protein coding sequence extends 
from base 1094 to 2693. 

Figure 3 demonstrates that all seven of the 
ubiquitin coding repeats encode an identical amino 
acid sequence. The nucleotide sequence of the 
seven repeats is shown aligned under the derived 
amino acid sequence. An additional 77th amino 
acid, glutamlne. is present in the 7th repeat pre- 
ceding the stop codon. A polyadenylatlon signal. 
AATAAT, is present in the 3* untranslated region, 
113bp from the stop codon. 

Figure 4 is a diagrammatic presentation of the 
procedure used for the construction of the maize 
ubiquitin promoter regionchloramphenicol acetyl 
transferase (CAT) gene fusion. 

Figure 5 presents an assay for the ubiquitin 
promoter. CaMV-CAT. cauliflower mosaic virus 358 
promoter - CAT gene fusion; UBQ-CAT, maize 
ubiquitin promoter - CAT gene fusion. 

DETAILED DESCRIPTION OF THE INVENTION 

The following definitions are provided In order 
to remove ambiguities as to the intent or scope of 
their usage in the specification and claims. 

Expression refers to the transcription and/or 
translation of a structural gene. 

Promoter refers to the nucleotide sequences at 
the 5' end of a structural gene which direct the 
initiation of transcription. Promoter sequences are 
necessary, but not always sufficient, to drive the 
expression of a downstream gene. In general, 
eukaryotic promoters include a characteristic DNA 
sequence homologous to the consensus 5'- 
TATAAT-3' (TATA) box about 10-30 bp 5' to the 
transcription start (cap) site, which, by convention, 
is numbered + 1 . Bases 3' to the cap site are given 
positive numbers, whereas bases 5' to the cap site 
receive negative numbers, reflecting their distance 
from the cap site. Another promoter component, 



the CAAT box, is often found about 30 to 70 bp 5' 
to the TATA box and has homology to the canoni- 
cal form 5'-CCAAT-3' (R. Breathnach and P. Cham- 
bon (1981) Ann. Rev. Biochem.. 50:349-383). In 

5 plants the CAAT box is sometimes replaced by a 
sequence known as the AGGA box, a region having 
adenine residues symmetrically flanking the triplet 
G(orT)NG (J. Messing et al. (1983), In Genetic 
Engineering of Plants . T. Kosuge et ai. (eds.), Ple- 

10 num Press, pp. 211-227). Other sequences confer- 
ring regulatory influences on transcription can be 
found within the promoter region and extending as 
far as 1000 bp or more from the cap site. 

Regulatory Control refers to the modulation of 

15 gene expression induced by DNA sequence ele- 
ments located primarily, but not exclusively, up- 
stream of (5' to) the transcription start site. Regula- 
tion may result in an all-or-nothing response to 
environmental stimuli, or it may result in variations 

20 in the level of gene expression. In this invention, 
the heat shock regulatory elements function to en- 
hance transiently the level of downstream gene 
expression In response to sudden temperature ele- 
vation. 

25 Placing a structural gene under the regulatory 

control of a promoter or a regulatory element 
means positioning the structural gene such that the 
expression of the gene is controlled by these se- 
quences. In general, promoters are found posi- 
30 tioned 5' (upstream) to the genes that they control. 
Thus, in the construction of heterologous promot- 
er/structural gene combinations, the promoter is 
preferably positioned upstream to the gene and at 
a distance from the transcription start site that 
35 approximates the distance between the promoter 
and the gene it controls in its natural setting. As is 
known In the art, some variation in this distance 
can be tolerated without loss of the promoter func- 
tion. Similarly, the preferred positioning of a regula- 
40 tory element with respect to a heterologous gene 
placed under is control reflects its natural position 
relative to the structural gene it naturally regulates. 
Again, as is known in the art, some variation in this 
distance can be accommodated. 
45 Promoter function during expression of a struc- 

tural gene under Its regulatory control can be test- 
ed at the transcriptional stage using DNA-RNA 
hybridization assays ("Northern" blots) and at the 
translational stage using specific functional assays 
50 for the protein synthesized (for example, by en- 
zymatic activity or by immunoassay of the protein). 

Structural gene is that portion of a gene com- 
prising DNA segment encoding a protein, polypep- 
tide or a portion thereof, and excluding the 5' 
55 sequence which drives the initiation of transcription. 
The structural gene may be one which Is normally 
found in the cell or one which is not normally found 
in the cellular location wherein it is introduced, in 
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which case It fs termed a heterologous gene . A 
heterologous gene may be derived in whole or in 
part from any source known to the art, including a 
bacterial genome or episome, eukaryotic. nuclear 
or plasmid DNA. cDNA, viral DNA or chemically 
synthesized DNA. A structural gene may contain 
one or more modifications in either the coding or 
the untranslated regions which could affect the 
biological activity or the chemical structure of the 
expression product, the rate of expression or the 
manner of expression control. Such modifications 
include, but are not limited to, mutations, insertions, 
deletions and substitutions of one or more 
nucleotides. The structural gene may constitute an 
uninterrupted coding sequence or it may include 
one or more introns. bound by the appropriate 
splice junctions. The structural gene may be a 
composite of segments derived from a plurality of 
sources, naturally occurring or synthetic. The struc- 
tural gene may also encode a fusion protein. It is 
contemplated that the introduction into plant tissue 
of recombinant DNA molecules containing the pro- 
moter/structural gene/polyadenylation signal com- 
plex will Include constructions wherein the struc- 
tural gene and its promoter are each derived from 
different plant species. 

Plant Ubiquitin Regulatory System refers to the 
approximately 1 kb nucleotide sequence 5' to the 
translation start site of the maize ubiquitin gene 
and comprises sequences that direct initiation of 
transcription, regulation of transcription, control of 
expression level, induction of stress genes and 
enhancement of expression in response to stress. 
The regulatory system, comprising both promoter 
and regulatory functions, is the DNA sequence 
providing regulatory control or modulation of gene 
expression. A structural gene placed under the 
regulatory control of the plant ubiquitin regulatory 
system means that a stnjctural gene is positioned 
such that the regulated expression of the gene is 
controlled by the sequences comprising the 
ubiquitin regulatory system. 

Polyadenylation signal refers to any nucleic 
acid sequence capable of effecting mRNA process- 
ing, usually characterized by the addition of 
polyadenylic acid tracts to the 3'-ends of the 
mRNA precursors. The polyadenylation signal DNA 
segment may itself be a composite of segments 
derived from several sources, naturally occurring or 
synthetic, and may be from a genomic DNA or an 
mRNA-derived cDNA. Polyadenylation signals are 
commonly recognized by the presence of homol- 
ogy to the canonical form 5'-AATAA-3', although 
variation of distance, partial "readthrough," and 
multiple tandem canonical sequences are not un- 
common (J. Messing et al. supra ). It should be 
recognized that a canonical "polyadenylation sig- 
nal" may in fact cause transcriptional termination 



and not polyadenylation ger se (C. Montell et al. 
(1983) Nature 305:600-605). 

Plant tissue includes differentiated and undif- 
ferentiated tissues of plants, including, but not limit- 
6 ed to roots, shoots, leaves, pollen, seeds, tumor 
tissue and various forms of cells in culture, such as 
single cells, protoplasts, embryos and callus tissue. 
The plant tissue may be in planta or in organ, 
tissue or cell culture. 
10 Homology , as used herein, refers to identity or 

near identity of nucleotide and/or amino acid se- 
quences. As is understood in the art, nucleotide 
mismatches can occur at the third or wobble base 
in the codon without causing amino acid substitu- 
fs tions in the final polypeptide sequence. Also, minor 
nucleotide modifications (e.g.. substitutions, inser- 
tions or deletions) in certain regions of the gene 
sequence can be tolerated and considered insig- 
nificant whenever such modifications result in 
20 changes in amino acid sequence that do not alter 
the functionality of the final product. It has been 
shown that chemically synthesized copies of whole, 
or parts of, gene sequences can replace the cor- 
responding regions in the natural gene without loss 
25 of gene function. Homologs of specific DNA se- 
quences may be identified by those skilled in the 
art using the test of cross-hybridization of nucleic 
acids under conditions of stringency as is well 
understood in the art (as described in Hames and 
30 Higgens (eds.) (1985) Nucleic Acid Hybridisation , 
IRL Press. Oxford. UK). Extent of homology is often 
measured in terms of percentage of identity be- 
tween the sequences compared. Thus, in this dis- 
closure it will be understood that minor sequence 
35 variation can exist within homologous sequences. 

Derived from is used herein to mean taken, 
obtained, received, traced, replicated or descended 
from a source (chemical and/or biological). A de- 
rivative may be produced by chemical or biological 
40 manipulation (including but not limited to substitu- 
tion, addition, insertion, deletion, extraction, isola- 
tion, mutation and replication) of the original 
source. 

Chemically synthesized , as related to a se- 
45 quence of DNA, means that the component 
nucleotides were assembled in vitro. Is/lanual 
chemical synthesis of DNA may be accomplished 
using well established procedures (M. Caruthers 
(1983) in Methodology of DNA and RNA Sequen- 
50 cing. Weissman (ed.), Praeger Publishers (New 
York) Chapter 1). or automated chemical synthesis 
can be performed using one of a number of com- 
mercially available machines. 

Heat shock elements refer to DNA sequences 
55 that regulate gene expression In response to the 
stress of sudden temperature elevations. The re- 
sponse is seen as an immediate albeit transitory 
enhancement in level of expression of a down- 
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stream gene. The original work on heat shock 
genes was done with Drosophila but many other 
species including plants (T. Barnett et ah (1980) 
Dev. Genet. 1^:331-340) exhibited analogous re- 
sponses to stress. The essential primary compo- 
nent of the heat shock elements was described in 
Drosophila to have the consensus sequence 5'- 
CTGGAAT-TTCTAGA-3' and to be located in the 
region between residues -68 through -47 bp up- 
stream to the transcriptional start site (H. Pelham 
and M. Bienz (1982) supra ). A chemically syn- 
thesized oligonucleotide copy of this consensus 
sequence can replace the natural sequence in con- 
ferring heat shock inducibility. In other systems, 
multiple heat shock elements were identified within 
the promoter region. For example, Rochester et al. 
(1986) supra recognized two heat shock elements 
in the maize hsp 70 gene. 

Leader sequence refers to a DNA sequence 
comprising about 100 nucleotides located between 
the transcription start site and the translation start 
site. Embodied within the leader sequence is a 
region that specifies the ribosome binding site. 

Introns or intervening sequences refer in this 
work to those regions of DNA sequence that are 
transcribed along with the coding sequences (ex- 
ons) but are then removed in the formation of the 
mature mRNA. Introns may occur anywhere within 
a transcribed sequence - between coding se- 
quences of the same or different genes, within the 
coding sequence of a gene, interrupting and split- 
ting Its amino acid sequences, and within the pro- 
moter region (5' to the translation start site). Introns 
In the primary transcript are excised and the cod- 
ing sequences are simultaneously and precisely 
ligated to fomn the mature mRNA. The junctions of 
Introns and exons form the splice sites. The base 
sequence of an intron begins with GU and ends 
with AG. The same splicing signal Is found in many 
higher eukaryotes. 

The present Invention relates to the develop- 
ment of a recombinant vector useful for the expres- 
sion of DNA coding segments in plant cells. The 
vector herein described employs a maize ubiquitin 
promoter to control expression of an inserted DNA 
coding segment. The transcriptional regulatory se- 
quences may be combined with an extrach- 
romosomal replication system for a predetermined 
host. Other DNA sequences having restriction sites 
for gene insertion may be added to provide a 
vector for the regulated transcription and translation 
of the Inserted genes in said host. The vector may 
also include a prokaryotic replication system allow- 
ing amplification in a prokaryotic host, markers for 
selection and other DNA regions. This would allow 
largo quantities of the vector to be grown In well 
characterized bacterial systems prior to transform- 
ing a plant or mammalian host. The principles for 



construction of a vector having proper orientation of 
the promoter and coding sequences with respect to 
each other are matters well-known to those skilled 
in the art. In some situations it may be desirable to 

5 join the promoter system to a desired structural 
gene and to introduce the resultant construct DNA 
directly into a host. Methods for such direct trans- 
fers include, but are not limited to, protoplast trans- 
formation, electroporation, direct injection of DNA 

10 into nuclei and co-transformatron by calcium pre- 
cipitation. 

This invention comprises the first report of an 
isolated and characterized plant ubiquitin promoter. 
The maize ubiquitin promoter as described in the 

75 present work includes the RNA polymerase rec- 
ognition and binding sites, the transcriptional Initi- 
ation sequence (cap site), regulatory sequences 
responsible for inducible transcription and an un- 
translatable intervening sequence (intron) between 

20 the transcriptional start site and the translational 
initiation site. Two overlapping heat shock consen- 
sus promoter sequences are situated 5' (-214 and 
-204) of the transcriptional start site. An exon of 83 
nucleotides is located immediately adjacent to the 

25 cap site and is followed by a large (approximately 
1 kb) Intron. 

The ubiquitin promoter along with the ubiquitin 
structural gene can be isolated on two approxi- 
mately 2kb Pst fragments of the maize genome 

30 (Figure 1). The entire fragment can be used to 
show promoter function by monitoring expression 
of mRNA or protein. Introduction of a heterologous 
gene downstream of the ubiquitin translation initi- 
ation codon will result in the expression of a fused 

35 protein. Insertion of a heterologous gene (having its 
own start and stop codons) between the ubiquitin 
promoter and translation initiation codon will result 
in the expression of the native polypeptide cor- 
responding to the inserted gene. The insertion of 

40 the desired structural gene is most conveniently 
accomplished with the use of blunt-ended linkers at 
the ends of the gene. 

Alternatively, the ubiquitin gene fragment may 
be restricted, particulariy at a site immediately pre- 

45 ceding the start of the structural gene or at a site 
preceding the transcription start site. For example, 
in the present invention the promoter fragment was 
derived from the ubiquitin gene as an approxi- 
mately 2kb ^1 fragment. To ensure that the pro- 

50 moter fragment is devoid of the translational Initi- 
ation codon, the fragment containing the 5' flanking 
region may be selectively digested with double 
stranded exonuclease under controlled conditions 
to remove a desired number of nucleotide pairs. It 

55 Is desirable to remove the ubiquitin translation initi- 
ation codon so that translation of the inserted gene 
will commence at its own start site. The isolated 
(and shortened) promoter fragment may then be 
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inserted Into the vector using linkers or homo- 
polymer tailing to Introduce desired restriction sites 
compatible with the remaining regions of the vec- 
tor. In general, the promoter fragment may be 
cleaved with specific restriction enzymes and the 
resultant shortened DNA fragments tested for pro- 
moter function and compared to that of the intact 
promoter. In addition. DNA codons may be added 
and/or existing sequences may be modified to give 
derivative DNA fragments retaining promoter func- 
tions. 

The resulting DNA constructs may be useful as 
cloning vehicles for a structural gene of interest in 
a plant host. In this invention, the structural gene 
encoding CAT under control of either the maize 
ubiquitin promoter or the cauliflower mosaic virus 
promoter was expressed in both oat and tobacco 
cells. When the ubiquitin promoter was employed, 
a greater degree of expression was obtained with 
the monocot host than with the dicot host; however, 
a higher level of expression was obtained with dicot 
than with monocot host when the cauliflower mo- 
saic virus promoter was utilized. The differential in 
expression levels reflects the inherent inequality of 
different promoters as well as basic cellular differ- 
ences in regulation of expression and processing 
between monocots and dicots. To date, it is not 
predictable, routine or obvious that a monocot pro- 
moter will operate in a dicot host cell. 

A wide variety of structural genes may be 
introduced into the subject DNA cloning vectors for 
the production of desired proteins, such as en- 
zymes, hormones and the like. In addition. DNA 
constructs of this type can be used for the en- 
hanced production of DNA derived form a particu- 
lar gene, as well as for enhanced production of 
mRNA which can be used to produce cDNA. Such 
vectors carrying specific DNA sequences find wide 
application and are quite versatile; for example, 
they can be used for amplification in bacteria as 
well as for expression in higher cells which allow 
for additional cellular functions. An advantage of 
utilizing higher eukaryotic recombinant systems to 
produce commercially medical and agriculturally 
desirable proteins Is that they ensure correct post- 
translational modifications which may otherwise be 
difficulty to duplicate in prokaryotic and lower 
eukaryotic hosts. 

In this invention the maize ubiquitin promoter 
was shown to function in oat and tobacco, as 
examples of monocots and dicots. respectively, 
and it is conceivable that this promoter can func- 
tion in yet other cells. Such systems include, by 
way of example, and without limitation, other cells 
from which ubiquitin genes have bene Isolated and 
found to be highly conserved, for example, other 
monocots in addition to maize, dicots other than 
tobacco, lower eukaryotic organisms such as yeast 



and mammalian cells. The screening of cellular 
systems suitable for use with the maize ubiquitin 
promoter can be accomplished according to the 
teaching herein, without undue experimentation. 
5 The construction of vectors suitable for the expres- 
sion of a DNA coding segment in individual sys- 
tems has been well documented. Shuttle vectors 
capable of replication in more than one host have 
also been described, for example, shuttle expres- 
10 ston vectors for both yeast and mammalian cells, 
for plants and animal cells and for plants and 
bacterial cells. In addition, it will be understood that 
ubiquitin genes from any other system, that are 
similar to the maize ubiquitin gene in functioning as 
IS a plant promoter, may be employed as the source 
for the ubiquitin promoter sequence. 

The present invention also relates to the utiliza- 
tion of the maize ubiquitin promoter as a heat 
shock promoter. Two heat shock consensus se- 
20 quences are located upstream of the maize 
ubiquitin gene at positions -214 and -204. In many 
eukaryotes, naturally occurring and chemically-syn- 
thesized sequences homologous to the heat shock 
consensus sequence have been shown to regulate 
25 the induction of gene expression. Although the 
ubiquitin promoter contains sequences that are 
Identified as being those of heat shock elements, 
the promoter is distinguished from classical heat 
shock promoters (1) in having a nontranslated in- 
30 tron 3' to the transcription start site and (2) in 
regulating ubiquitin expression constitutively as 
well as inductively. The functional relationship be- 
tween heat shock elements and the presence of a 
large intron within the promoter region is unknown 
35 to prior art. The nucleotide distance between these 
characteristic features and also the directionality 
and orientation of one element with respect to the 
other are presumed in the present work to be 
variable, as long as the basic promoter function of 
40 the derivative regulatory fragments remains active. 

The presence of an intron in the promoter 
region has been related to the relative stability of 
the unprocessed mRNA transcript and, indirectly, 
to the level of protein synthesized (Callls et al. 
45 (1 987) Genes and Development 1 : 1 1 83-1 200). Con- 
stitutively expressed ubiquitin mRNA has been re- 
ported to be maintained at stable levels in chicken 
embryo fibroplasts. whereas ubiquitin mRNA 
formed in response to stress has a half-life of 
50 approximately 1.5 to 2 h. 

In yeast four distinct ubiquitin-coding loci have 
been described. Constitutively expressed ubiquitin 
is encoded by one or more of three of the ubiquitin 
genes, two of which contain an approximately 400 
55 bp Intron immediately within the coding region. The 
fourth ubiquitin gene, devoid of a nontranslated 
intron but comprising multiple heat shock elements, 
functions primarily in inducing ubiquitin expression 
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In response to stress. It has been shown that the 
latter ubiquitin gene does not act constitutively but 
rather is turned on in response to heat shock or 
stress signal (E. Ozkaynak et al. (1987) EMBO J. 
6:1429-1439). 

In maize, ubiquitin is encoded by a small mul- 
tigene family. In this invention is presented the 
nucleotide sequence of one of the ubiquitin genes. 
A large (approximately 1 kb) intron between the 
transcriptional and the transiational start sites as 
well as nucleotide sequences corresponding to 
consensus heat shock sequences are found within 
the maize ubiquitin promoter region. These two 
regions of specialization most probably are in- 
volved in ubiquitin synthesis and in regulating the 
ubiquitin level In response to extemal Influences. 
The functional relationship between the intron and 
the heat shock elements encompassed within the 
ubiquitin promoter is unknown. It is reported in this 
invention that the maize ubiquitin promoter regu- 
lates the synthesis of mRNA both under normal 
and under heat shock conditions and that changes 
In the regulation of transcription account for the 
enhancement in ubiquitin synthesis after heat 
shock. 

The following examples are offered by way of 
illustration and not by way of limitation. 

EXAMPLES 

Example 1: Isolation and Characterization of the 
Maize Ubiquitin Gene 

A. Growth of Plants 

Zea mays Inbred line B73 was grown in moist 
verrniculite for 4 to 5 days at 25 • C in the dark. The 
' portion of the seedlings from the mesocotyl node 
to the shoot tip was harvested, frozen in liquid 
nitrogen and stored at -80 • C. 

B. RNA Isolation and Analysis 

Total cellular RNA was extracted from frozen 
tissue using the guanidlne thiocyanate procedure. 
Poly(A) + RNA was isolated from total cellular RNA 
by passage over a poly U-Sephadex (Bethesda 
Research Laboratories, Gaithersburg, MD) column. 
Total or poly(A)+ RNA was electrophoresed in 
1.5% agarose gels containing 3% (wtA^ol) formal- 
dehyde. RNA was transfen-ed to Gene Screen*™ - 
(DuPont) by capillary blotting using 25 mM sodium 
phosphate (pH6.5). 

Blots were prehybridized in 50% formamide, 
5XSSC, lOOug denatured salmon DNA. 40mM so- 
dium phosphate (pH6.8). 0.5% BSA and 1% SDS. 
Blots were hybridized in 50% formamide. 5XSSC, 
lOOug/ml denatured salmon DNA, 40mM sodium 



phosphate 9pH6.8) and 10% dextran sulfate. 

C. cDNA Library Construction 

s Double stranded cDNA was synthesized from 

poly(A)+ RNA by a modification of the method of 
Murray et al. (1983) Piant Mol. Biol. 2:75-84. Oligo- 
(dC)-tailing of the double-stranded cDNA and an- 
nealing of oligo(dC)-tailed cDNA with oligo(dG)- 

70 tailed pBR322 were performed using standard 
technology. The annealed DNA was transformed 
into E. coli HB101 and plated directly onto nitrocel- 
lulose filters (Millipore, HATF; 0.45 um) on L-agar 
plates containing tetracycline (15 ug/ml). 

15 

D. Identifications of Ubiquitin cDNA 

A number of cDNAs representing potentially 
light-regulated mRNAs were obtained by screening 

20 a cDNA library by differential hybridization. Several 
of these cDNAS were selected and further 
screened by RNA blot analysis to confirm light 
regulation. One cDNA clone, p6R7.2b1. while not 
representing a red-light regulated mRNA, was of 

25 interest because it hybridized with three poly(A) + 
RNAs of different size and abundance. Nick trans- 
lated p6T7.2b1 hybridized strongly with the 2100 
nucleotide and 1600 nucleotide mRNAs, but only 
weakly with the 800 nucleotide transcript. However, 

30 hybridization of Northern blots with a single strand- 
ed 32p-|abeled RNA generated by SP6 polymerase 
transcription of linearized pCA210, a plasmid con- 
structed by subcloning the cDNA insert of 
p6T7.2b1 into pSP64, readily detected all three 

36 transcripts. 

Since RNA-RNA hybrids are known to be more 
thermally stable than DNA-RNA hybrids, single 
stranded RNA probes rather than nick translated 
DNA probes were used in Northern blot hybridiza- 

40 tlons. Again, the 1600 base transcript was found to 
be about 3 fold less abundant than the 21 00 base 
transcript as determined from Northern blots, re- 
gardless of whether the blot was hybridized with 
nick translated DNA or single strand RNA probes. 

45 The smallest transcript was about half as abundant 
as the 2100 base mRNA in blots hybridized with 
RNA probes. 

Restriction fragments were subcloned into 
M13mp18 and/or mp19 and sequenced by the 

50 dideoxynucleotide chain termination method. Ana- 
lysis of the sequence of the clone revealed a single 
long open reading frame of 818 bp terminating in a 
TAA stop codon. The National Biomedical Re- 
search Foundation library was searched using the 
55 D fast P program for protein sequences homolo- 
gous with the deduced amino acid sequence. 
Greater than 95% homology was found between 
the deduced amino acid sequence of the maize 
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cDNA clone and the sequences of bovine and 
human ubiquitin. 

E. Genomic Library Construction and Screening 

High molecular weight maize DNA was isolated 
from frozen maize seedling. DNA was partially di- 
gested with Sau3A, size fractionated and cloned 
into the Bam HI sites of Charon 35 (Loenen et al. 
(1983) Gene 26:171-179). A library of about 
2X10^pfu was screened for recombinant phage 
containing sequences homologous to the ubiquitin 
cDNA clone by in situ plaque hybridization using a 
ubiquitin cDNA clone as a hybridization probe. Re- 
combinant phage were purified from broth lysates 
and phage DNA was isolated using standard tech- 
niques. Restriction endonucfease digestions were 
carried out according to manufacturers' specifica- 
tions. 

F, Genomic Southern Blot Analysis 

Isolated, high molecular weight maize DNA was 
digested with EcoRI. Hindlll and Sad , fractionated 
on 0.7% agarose gels and the DNA fragments were 
transfen-ed to Gene Screen Plus™ (DuPont). Filters 
were prehybridized for 6-8h at 65 'C in 6XSSC 
(1XSSC = 0.1 5M NaCI. 0.025M Na Citrate), 5X 
Denhardt*s medium. 100 iig/ml denatured, sonicat- 
ed Salmon DNA, 20 ug/ml polyadenyllc acid, 10 
mM disodium EDTA and 0.5% SDS. Filters were 
hybridized at 65 • C in fresh buffer with ^p labeled 
plasmid DNA (pCA210). Autoradiography was car- 
ried out at -80 -C using Kodak X-OMAT AR Rim 
and one DuPont Cronex LightningPlus intensifying 
screen. In each digest, 8 to 10 restriction fragments 
hybridized with the nick translated pCA2lO probe, 
suggesting that ubiquitin is coded by a small mul- 
tigene family. Evidence that ubiquitin is encoded 
by a small multigene family has also been reported 
for Xenopus, barley and yeast. 

Two or three fragments in each digest hybridiz- 
ed strongly with the probe, whereas the remainder 
of the fragments hybridized weakly. The differ- 
ences in hybridization intensities may reflect dif- 
ferent sequence homology such that the cDNA 
probe hybridizes preferentially to the gene from 
which it was derived. 

Ubiquitin genes from yeast and Xenopus have 
been characterized and have six and at least 
twelve ubiquitin repeats, respectively. Maize genes 
corresponding to the three transcripts detected on 
Northern blots may have seven, five and one or 
two ubiquitin repeats in the 2.1, 1.6 and 0.8 kb 
mRNAs, respectively. The maize ubiquitin gene 
described in this invention codes for seven repeats. 
Thus, the difference in hybridization intensity ob- 
served on Southern blots may be a result of the 



restriction fragments containing a different number 
of ubiquitin repeats. 

The ubiquitin cDNA clone did not contain 
EcoRl and Hindlll sites. However, the maize 

5 ubiquitin genes may contain introns which are cut 
by the restriction endonucleases used in the 
genomic digests. This could result in ubiquitin ex- 
ons being on different fragments and could account 
for the differential hybridization intensities observed 

10 in the Southern blots. 

G. Ubiquitin Sequence Analysis and Transcription 
Start Site Analysis 

T5 DIdeoxynucleotlde chain termination sequenc- 

ing was performed using Klenow fragments of DNA 
polymerase 1 (Boehringer Mannheim). A 1.85 kb 
PstI fragment of the genomic clone 7.2bl (see 
Figure lb) homologous to the cDNA clone 
20 p6T7.2b.1 and the 2 kb PstI fragment immediately 
upstream, termed AC3#9M13RF, were subcloned 
In both orientations into M13mp19. Recombinant 
phage RF DNA was prepared as for plasmid DNA. 
Unidirectional progressive deletion clones for 
25 sequencing both strands of these PstI fragments 
were prepared. Exonuclease II! and Exonuclease 
VII were obtained from New England Biolabs and 
Bethesda Research Laboratories, respectively. 
Computer analysis of DNA sequences was per- 
30 formed using programs made available by the Uni- 
versity of Wisconsin Genetics Computer Group. 

The transcription start site of the ubiquitin gene 
and the 3' junction of the intron and exon in the 5' 
untranslated region of the gene were determined 
35 by SI nuclease mapping. Fragments suitable for 
S1 probes were prepared as follows. The ubiquitin 
DNA was digested with either Bglll or Xhol. These 
were then labeled with ^^p using -^^P ATP (6000 
Ci/mmol. New England Nuclear, Boston, MA) and 
40 T4 polynucleotide kinase (New ENgland Biolabs). 
Subsequent digestion of the Bgll and Xhol kinased 
fragments with PstI and EcoRI, respectively, gen- 
erated a 946 bp Pstl-Bglll fragment and a 643 bp 
EcoRI-Xhoi fragment- These fragments were sepa- 
45 rated from the other end-labeled fragments by 
electrophoresis through a 5% polyacrylamide gel. 
Slices containing the 946 bp Pstl-Bglll and the 643 
pb EcoRI- Xho l fragments were cut out of the gel 
andihe labeled DNAs were eluted from the gel. 
50 End-labeled DNA fragment (10-20 fmole) was 
hybridized with 2 UQ of poly(A) + RNA in 30 ul of 
buffer containing 80% deionized formamide, 0.4M 
sodium chloride. 40mM PIPES 9pH6.4) and 1mM 
EDTA (pHB.O). The nucleic acid solution was heat- 
55 ed to 80 "C for about 16h. Ice-cold S1 digestion 
buffer (300ul) containing 280mM sodium chloride. 
50mM sodium acetate (pH4.6), 4.5mM zinc sulfate 
and 20 ug/ml single stranded DNA was added and 
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the DNA digested with 250 units/ml of SI nuclease 
(New England Nuclear). The reaction was topped 
with 75ul of SI termination mix containing 2.5M 
ammonium acetate and 50m M EDTA. The products 
of the 81 nuclease digestion were then separated 
on a 6% polyacrylamide/8IS^ urea gel and visual- 
ized by autoradiography. The end points of the SI 
protected fragments in the ubiquitin sequence were 
determined by comparison with a sequence ladder 
generated by Maxam/Gilbert base modification- 
cleavage reactions carried out on the end labeled 
fragments used as SI probes. 

The DNA sequence of the maize ubiquttin-1 
gene, 7.2b1. is shown in Figures 2-1 to 2-7. The 
sequence is composed of 899 bases upstream of 
the transcription start site, 1992 bases of 5' untran- 
slated and intron sequences, and 1999 bases en- 
coding seven ubiquitin protein repeats preceding 
249 bases of 3' sequence. A "TATA" box is lo- 
cated at -30 and two overlapping heat shock ele- 
ments are located at -214 and -240. The DNA 
sequence of the coding and 3' regions of the 
ubiqultln-1 gene from maize. 7.2b1, is also pre- 
sented if Figure 3. The derived amino acid se- 
quence of maize ubiquitin is shown at the top and 
the nucleotide sequence of the seven ubiquitin 
repeats is aligned underneath. A schematic of the 
organization of the seven complete ubiquitin units 
in the genomic DNA is shown in Figure 1C. 

The derived amino acid sequences of all of the 
ubiquitin repeats are identical (Figure 3). The termi- 
nal (seventh) ubiquitin repeat contains an additional 
77th amino acid, glutamine. prior to the TAA stop 
codon. This additional amino acid is not found in 
mature ubiquitin, and is apparently removed during 
processing. The 77th amino acid of the final repeat 
in the human gene is valine, while in the two 
chicken genes, It is tyrosine and asparagine. Yeast 
and barley also have an extra amino acid, as- 
paragine and lysine, respectively; however, an ex- 
tra amino acid was not found In the Xenopus gene. 
This extra amino acid has been proposed to func- 
tion as a block to conjugation of unprocessed 
polyublquitin to target proteins. A polyadenylation 
signal (AATAAT) is present in the 3' untranslated 
sequence, 1 13 bp from the stop codon. 

All seven repeats encode the identical amino 
acid sequence, whereas the nucleotide sequence 
of the repeats varies by as many as 39 
nucleotides. This is similar to what has been re- 
ported for the nucleotide sequence homologies be- 
tween ubiquitin genes. About 80% of the nucleotide 
mismatches between ubiquitin repeats are at the 
third (wobble) base in the codon. Alternate codon 
usage for leucine (5 codons). serine (3 codons) and 
arginine (3 codons) account for the remaining 
nucleotide mismatches. 



The amino acid sequence for maize ubiquitin is 
identical to that determined for two other higher 
plants, oat and barley. The sequence differs from 
the sequence reported for yeast by two amino 

5 acids; alanine for serine substitutions at positions 
28 and 57. The maize sequence is also slightly 
different form that reported for ubiquitin from all 
animals; substitutions by serine for proline at posi- 
tion 19, aspartate for glutamate at position 24 and 

10 alanine for serine at position 57. Thus, based on 
sequence, there appear to be three types of 
ubiquitin: plant, animal and yeast. 

Example 2: Construction of plasmid pUB-CAT com- 
15 prising the maize ubiquitin promoter and a struc- 
tural gene 

A. Promoter isolation and construction of pUB-CAT 

20 The procedure used for construction of the 

ubiquitin gene upstream region-chloramphenicol 
acetyl transferase (CAT) gene fusion is outlined in 
Rgure 4, The BamHI-Hindlll restriction fragment 
containing the CAT gene and the nopaline syn- 

25 thase (NOS) 3' untranslated region and 
polyadenylation signal of pNOS-CAT (Fromm et al. 
(1985) Proc. Natl. Acad. Sci. 82:5824-5828) was 
subcloned into BamHI and Hindlll digested pUCia. 
This construct was termed pUB-CAT. 

30 An approximately 2.0kb PstI fragment imme- 

diately upstream of the ubiquitin polyprotein coding 
region of the maize ubiquitin gene 7.2bi was sub- 
cloned Into M13mp19. This segment of DNA spans 
nucleotides -899 to 1092 of the maize ubiquitin 

35 sequence documented in Figures 2-1 to 2-7. This 
recombinant DNA was termed AC3#9IV113RF and 
contains the ubiquitin promoter, 5' untranslated 
leader sequence and about Ikb intron, labeled OBI- 
S' in Figure 4. 

40 The ubiquitin promoter-CAT reporter gene fu- 

sion was constructed by blunt ending with T* DNA 
polymerase the 2.0kb PstI fragment of 
AC3#9M13RF and cloning this fragment into Sma l- 
dlgested pUC18-CAT. This construct was termed 

45 pUB-CAT. 

B. Introduction of Recombinant DNA into Oat and 
Tobacco Protoplast 

50 Leaves (2g) of 5- to B-day old etiolated oat 

seedlings were finely chopped with a razor blade. 
The tissue was rinsed several times with digestion 
medium (3mM MES. pH5.7, lOmM calcium chlo- 
ride, 0.5M mannitol and 2 mg/ml arginine) and then 

55 incubated for 4h at room temperature with 20 ml 
digestion medium containing 2% cellulase. The 
tissue was shaken occasionally to release proto- 
plasts. The material was filtered through a 63 nm 
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mesh and centrifuged 5 min at 50xg. The super- 
natant fluid was removed and the protoplasts were 
washed two times with digestion medium and then 
resuspended In electroporation buffer to give 0.5 
ml of protoplast suspension per electroporation. 
The electroporation buffer consisted of: lOmM 
HEPES, pH7.2, 150mM sodium chloride, 4mM cal- 
cium chloride and 0.5M mannrtol. 

Protoplasts (0.5ml) in electroporation buffer 
were mixed on ice with 0.5ml of electroporation 
buffer containing 40ug plasmid DNA plus lOOug 
sonicated salmon DNA. The protoplasts were elec- 
troporated on ice with a 350 volt. 70msec pulse. 
The protoplasts were incubated another 10 min on 
ice, then diluted Into 10ml Murasige-Skoog (MS) 
medium and incubated at room temperature for 
24h. 

Protoplasts were pelleted by centrifugation for 
5 min at 50 xg. The supematant fluid was removed 
and the protoplasts washed once with IS^S medium. 
The protoplast pellet was resuspended in 200ul 
Buffer A <0.25M Tris. pH7.8, ImM EDTA. ImM ^- 
mercaptoethanol) and transferred to a microcen- 
trifuge tube. Protoplasts were disrupted by sonica- 
tion for 5-10 sec at the lowest setting. Protoplast 
debris was pelleted by centrifugation for 5 min at 
4 • C. The supernatant fluid was removed, heated to 
65 • C for 1 0 min and stored at -20 • C. 

C. Assay for CAT activity In transformed proto- 
plasts 

Aliquots (IOOu.1) of the electroporated proto- 
plast extract (extract of cells transformed with re- 
combinant DNA) were added to 80ul of Buffer A 
and 20ul of a mix of 20ul ^♦C-chloramphenicol (40- 
60 mCi/mM), 2mg acetyl CoA and 230ul Buffer A. 
The reaction was incubated for 90 min at 37 'C. 
The reaction products were extracted with BOOul 
ethyl acetate and were concentrated by evaporat- 
ing the ethyl acetate and resuspending in lOul 
ethyl acetate. The reaction products were sepa- 
rated by thin layer chromatography using chlo- 
roform:methanol (95:5.v/v) solvent and were de- 
tected by autoradiography. 

Transformation of host cells was determined by 
measuring the amount of enzymatic activity ex- 
pressed by the structural gene contained within the 
promoter-gene fusion construct. In this example, 
the structural gene encoding chloramphenicol ace- 
tyl transferase was employed in the DNA construct. 
To test the efficacy of the promoter utilized in the 
recombinant DNA fusion construct, parallel elec- 
troporations were carried out, utilizing either the 
maize ubiqultin promoter-CAT gene fusion pUC- 
CAT (described herein and In Figure 4) and 
pCaMV-CAT, a cauliflower mosaic virus 35S pro- 
moter-CAT gene fusion (Fromm et al. (1985) Proc. 



Natl. Acad. Sci. USA 82:5824-5828) obtained from 
V. Walbot, Stanford University. As illustrated in 
Figure 5, in oat protoplasts the ubiquitin promoter 
is "stronger" than the CaMV promoter, as judged 
5 by the amount of enzymatic activity expressed. 

Example 3: Heat Shocic Response 

A. Heat Shock Treatment 

10 

To heat shock, 4- to 5-day old etiolated seed- 
lings were transferred to an incubator at 42 • C and 
harvested 1. 3 and 8h after transfer. Total RNA 
(7ug) was isolated, denatured and electrophoresed 

15 through a 1.5% agarose 3% formaldehyde gel. The 
RNA was transferred to Gene Screen and probed 
with single stranded RNA transcribed from linear- 
ized pCA210 using SP6 RNA polymerase. (The 
recombinant plasmid. pCA210, was constructed by 

20 subcloning the 975 bp insert of p6R7.2b1 into 
pSP64 (Promega) so that SP6 RNA polymerase 
synthesized an RNA probe specific for hybridiza- 
tion with ubiquitin mRNA.) After autoradiography, 
the bands were cut out and the amount of radioac- 

25 tivity bound to the filter was determined by liquid 
scintillation. From analysis of the Northern blots, 
levels of three ubiquitin transcripts were deter- 
mined. 

One hour after transfer to 42 ' C, the level of 
30 the 2.1 kb transcript increased 2.5 to 3 fold. An 
approximately 2 fold increase was observed for the 
1.6kb transcript, however, no increase was seen for 
the 0.8kb transcript. By three hours after transfer of 
the seedlings to elevated temperature, the levels of 
35 the two largest ubiquitin transcripts had returned to 
the level observed in unshocked tissue and re- 
mained at those levels for at least another five 
hours. The transitory nature of ubiquitin during the 
heat shock response in maize may indicate the 
40 ubiquitin has a specialized role in heat shock and 
that only brief periods of Increased levels of 
ubiquitin are required. 

B. Heat Shock Sequences 

45 

The nucleotide sequence of the maize ubiquitin 
gene is presented in Figures 2-1 to 2-7. Within the 
promoter region are nucleotide sequences homolo- 
gous to the consensus heat shock sequence that 
50 has been shown to confer stress inducibility when 
placed upstream of heterologous promoters (Pel- 
ham (1982) supra ). The consensus sequence for 
the Drosophila heat shock element is 

5'-CTGGAAT__TTCTAGA-3' 
55 and is generally found approximately 26 bases 
upstream of the transcriptional start site. 

Located within 900 bases 5* to the transcrip- 
tional start site of the maize ubiquitin promoter are 
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two overlapping heat shock sequences: 

5'-CTGGA CCCCTCTCGA-3' starting at 
nucleotide -214, and 

5*-CTCGA GAGTTCCGCT-3' starting at 
nucleotide -240. The ubiquitin promoter from chick- 
en embryo fibroblasts was also found to contain 
two overlapping heat shock consensus promoter 
sequences: 

5*-CTCGA ATCTTCCAG-3' starting at 

nucleotide -369, and 

5'-CCAGA GCTTTCTTTT-3' starting at 
nucleotide -359. The 5' flanking region of the yeast 
ubiquitin gene UB14 (E. Ozkaynak et al. (1987) 
supra ) comprises an 18 kb, rotationally symmetric 
(palindromic) sequence. 5'-TT CTAGAACGTTC- 
TAGAA-3'. 365 teases upstream of the translation 
start site. The middle 14 bases (underlined) of this 
18 bp sequence contain an exact homology to the 
rotationally symmetric consensus *heat shock box' 
nucleotide sequence starting at approximately 284 
nucleotides upstream of the presumed transcription 
start site. 

The relative position of the heat shock se- 
quence with respect to the transcriptional initiation 
codon and its ultimate consequence on the mag- 
nitude of the induction response to heat shock or 
other stress remains largely unknown, although it 
has been suggested (U. Bond et al. (1986) supra ) 
that the further a heat shock element is located 5' 
from the transcriptional start site, the smaller is the 
level of Induction in response to stress. 

In this invention it is assumed that a heat shock 
sequence may be arbitrarily positioned at different 
loci within the ubiquitin promoter and that It may be 
chemically altered in sequence or be replaced with 
a synthetic homologous sequence, so long as the 
modified promoter sequence retains ubiquitin pro- 
moter function, which comprises the initiation, di- 
rection and regulation of transcription under stress 
and non-stress conditions. Biochemical techniques 
required to insert and/or delete nucleotides and to 
manipulate the resultant DNA fragments are well- 
known to those skilled in the art of genetic redesig- 
ning. 

Example 4: Presence of Heat Shock Sequence(s) 
and a Large Intron Within the Ubiquitin Promoter 

The ubiquitin promoter from maize Is char- 
acterized structurally by the presence of two over- 
lapping heat shock sequences approximately two 
hundred bp upstream of the transcriptional start 
site and that of a large (approximately Ikb) intron 
between the transcriptional start site and the tran- 
slational initiation codon. This promoter structure is 
very similar to that reported (U. Bond et al. (1986) 
supra) for the ubiquitin promoter from chicken em- 
bryo" fibroblasts in which two overlapping heat 



shock sequences are located approximately 350 bp 
upstream of the transcriptional start site and a 674 
bp intron is contained between the transcriptional 
and translational initiation codons. Recently (E. Oz- 
5 kaynak et al. 91987) supra ), the nucleotide se- 
quence of the promoter region from yeast ubiquitin 
UB14 gene was determined and found to contain a 
heat shock sequence approximately 280 bp up- 
stream of the transcriptional start site, but this 
10 yeast ubiquitin promoter was devoid of a large 
intron between the transcription and translation initi- 
ation sites. However, two other yeast ubiquitin 
genes, which did contain introns, were found to be 
lacking sequences homologous to the Pelham 
/5 "heat shock box" sequence, 

Ubiquitin promoters have been shown to up- 
regulate expression of ubiquitin in response to heat 
shock in yeast, chicken embryo fibroblasts and 
maize. In all three systems, the level of ubiquitin 
20 mRNA is elevated after heat shock treatment and 
the Increase in ubiquitin level was determined in 
maize and chicken embryo fibroblasts to be ap- 
proximately 3 fold. This enhancement in ubiquitin 
expression in response to heat shock is signifi- 
25 cantly less than that obtained with other heat shock 
genes. It was found in chicken embryo fibroblasts 
tfiat the levels of ubiquitin mRNA in cells exposed 
to 45 'C increase by 2.5 fold over a 2.5h period, 
whereas the levels of HSP70 mRNA Increased 10 
30 fold under the same heat shock conditions. More- 
over, the relative instability of ubiquitin mRNA dur- 
ing recovery of cells from a 3h heat shock (half-life 
of approximately 1.5 to 2h) was also found to differ 
significantly from that of HSP70 mRNAs which 
35 were found to be stable. 

It is interesting to note that in contrast to 
ubiquitin promoters, HSP70 genes do not contain 
large introns between the transcriptional and tran- 
slational initiation codons. Another difference be- 
40 tween the ubiquitin promoter and other heat shock 
promoters is that ubiquitin is expressed both con- 
stitutively and inductively, whereas expression of 
classical heat shock proteins occurs predominantly 
In response to heat shock or other stress. This 
45 Invention allows skilled workers knowledgeable in 
ttio art to modify ubiquitin promoter with respect to 
the composition sequence and position of both the 
Intron and the heat shock sequences In order to 
alter constitutive and/or inductive expression of 
50 ubiquitin. Also, standard recombinant technology 
may be employed to reposition, as well as to 
chemically alter the nucleotide sequences within 
the maize ubiquitin promoter region in such a fash- 
ion as to retain or Improve the promoter function of 
55 the resultant modified DNA. Testing for ubiquitin 
promoter function may be carried out as taught in 
Example 2. 
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Claims 



1. 



2. 



3. 



A DNA sequence no larger than 2 kb, said 
DNA sequence comprising a plant ubiqultin 
regulatory system, wherein said regulatory 
system contains a heat shock element and an 
intron. 

A DNA sequence as in claim 1 wherein said 
heat shock element is at least 75% homolo- 
gous to the heat shock consensus sequence. 

A DNA sequence as in claim 1 wherein said 
heat shock element is not more than approxi- 
mately 214 nucleotides 5' of the transcription 
start site. 



4. 



5. 



6. 



7. 



B. 



9. 



A DNA sequence as in claim 1 wherein said 
intron is approximately 1 kb in length. 

A DNA sequence as in claim 6 wherein said 
Intron is that of the maize ubiquitin promoter. 

A DNA sequence as in claim 6 wherein said 
intron is located not more than 83 nucleotides 
3' of the transcription start site. 

A DNA sequence as in claim 1 wherein said 
heat shock element is located upstream of the 
transcription start site and wherein said intron 
is located downstream of the transcription start 
site. 



10. A DNA construct comprising: 

(a) a DNA sequence no larger than 2 kb, 
said DNA sequence comprising a plant 
ubiquitin regulatory system, wherein said 
regulatory system contains a heat shock 
element and an intron, and 

(b) a plant-expressible structural gene 
wherein said structural gene is placed under 
the regulatory control of said plant ubiquitin 
regulatory system. 

11. A DNA construct as in claim 10 wherein said 
structural gene codes for a gene that is not 
under heat shock control in nature. 

12. A method for the constitutive expression of a 
structural gene and the selected stress-in- 



10 



IS 



duced enhancement in expression of said 
structural gene in a plant ceil comprising the 
steps of: 

(a) transforming said plant cell with a DNA 
construct comprising a plant ubiquitin regu- 
latory system, wherein Is found a heat 
shock element and an intron, and a plant- 
expressible structural gene that is under the 
regulatory control of said plant ubiquitin reg- 
ulatory system, and 

(b) selectively applying stress conditions to 
said transformed plant cell thereby inducing 
enhancement in expression of said struc- 
tural gene. 

Patentanspriiche 



A DNA sequence as in claim 3 wherein said 
regulatory system contains two heat shock ele- 
ments. 

A DNA sequence as In claim 4. wherein said 
heat shock elements are situated so as to 
overlap. 



1. 



2. 



25 



3. 



30 



4. 



35 



5. 



40 



6. 



DNA-Sequenz, nicht langer als 2 kb. umfas- 
send ein Pflanzen-Ubiquitin-Regulatorsystem. 
worin das Regulatorsystem ein Hitzeschock- 
Element und ein Intron enthalt. 

DNA-Sequenz nach Anspruch 1 . worin das Hit- 
zeschock-Element eine mindestens 75%ige 
Homologie zu der Hitzeschock-Consensus-Se- 
quenz aufweist. 

DNA-Sequenz nach Anspruch 1 , worin das Hit- 
zeschock-Element nicht mehr als etwa 21 4 Nu- 
kleotide in 5'-Richtung von der Transkriptions- 
startstelle entfernt ist. 

DNA-Sequenz nach Anspruch 3, worin das Re- 
gulatorsystem zwel Hitzeschock-Elemente ent- 
halt. 

DNA-Sequenz nach Anspruch 4, worin die Hit- 
zeschock-Elemente derart angebracht sind, 
daB sie sich uberiappen. 

DNA-Sequenz nach Anspruch 1. worin das In- 
tron eine Lange von etwa 1 kb hat. 



45 



50 



55 



7. DNA-Sequenz nach Anspruch 6, worin das In- 
tron jenes des Mais-Ubiqultin-Promotors ist. 

8. DNA-Sequenz nach Anspruch 6, worin das In- 
tron nicht mehr als 83 Nukleotlde In 3'-Rich- 
lung von der Transkriptionsstartstelle entfernt 
ist. 

9. DNA-Sequenz nach Anspruch 1 , worin das Hit- 
zeschock-Element upstream der Transkrip- 
tionsstartstelle lokalisiert ist und das Intron 
downstream der Transkriptionsstartstelle lokali- 
siert ist. 

10. DNA-Konstrukt, umfassend: 
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(a) DNA-Sequen2, nicht !anger als 2 kb, 
umfassend ©in Pflanzen-UbiquUin-Regulator- 
system, worin das Regulatorsystem ein Hit- 
zeschock-Element und eIn Intron enthSlt. 
und 

(b) Pflanzen-exprimierbares Strukturgen, 
worin das Strukturgen unter die regulatori- 
sche Kontrolle des Pflanzen-Ubiquitin-Regu- 
latorsystems gestellt ist. 

11. DNA-Konstmkt nach Anspruch 10. worin das 
Strukturgen fUr eIn Gen codiert, das in der 
Natur nicht unter einer Hitzeschock-Kontrolle 
steht. 

12. Verfahren zur konstitutiven Expression eines 
Strukturgens und zur selektierten StreB-indu- 
zierten Expressionseriiohung des Strukturgens 
in einer Pflanzenzelle. umfassend die folgen- 

den Sciiritte: 

(a) Transfomnation der Pfianzenzelle mit ei- 
nem DNA-Konstrukt, umfassend ein l^lan- 
zen-Ubiquitin-Regulatorsystem, worin ein 
Hitzeschock-Element und ein Intron vorliegt. 
und ein Pflanzenexprimierbares Strukturgen, 
das unter der regulatorisclien Kontrolle des 
Pflanzen-Ubiquitin-Rogulatorsystoms steht, 

und 

(b) selektive Anwendung von StreBbedin- 
gungen auf die transformierte Pflanzenzelle, 
wodurch ©ine Expressionserhohung des 
Strukturgens induziert wird. 

Revendications 

1. Sequence d'ADN d© longueur non sup6rieure 
h 2 kb, cette sequence d'ADN comprenant un 
system© r^gulateur de Tubiquitine veg^tale. co 
systfeme r^gulateur contenant un §l6ment de 
choc thermique et un intron. 

2. Sequence d'ADN selon la revendication 1, 
dans laquelle I'^l^ment de choc thermique pr6- 
sente une homologio d'au moins 75% avec la 
sequence consensus du choc thermique. 

3. Sequence d'ADN selon la revendication 1, 
dans laquelle r6l6ment de choc thermique 
n'est pas 6loigne du site d'initiation de la trans- 
cription de plus de 214 nucleotides en 5' envi- 
ron. 

4. Sequence d'ADN selon la revendication 3, 
dans laquelle le systfeme regulateur contient 
deux 6l6ments de choc thermique. 

5. Sequence d'ADN selon la revendication 4. 
dans laquelle les 6l6ments de choc thermique 



TO 



sont situ6s de fagon a etre en chevauchement. 

6. Sequence d'ADN selon la revendication 1, 
dans laquelle rintron a une longueur d'environ 
1 kb. 

7. Sequence d'ADN selon la revendication 6, 
dans laquelle I'intron est celui du promoteur de 
Tubiquitine du maYs. 

a Sequence d'ADN selon la revendication 6, 
dans laquelle I'intron est situe a une distance 
du site d'initiation de la transcription non supe- 
rieure a B3 nucleotides en 3\ 



15 



9. Sequence d'ADN selon la revendication 1. 
dans laquelle I'element de choc thermique est 
situ4 en amont du site d'initiation de la trans- 
cription, rintron 6tant situ6 en aval du site 

20 d'initiation de la transcription. 

10. Construction d'ADN comprenant : 

(a) une sequence d'ADN de longueur non 
sup^rieure a 2kb FR/EU 55491 E cette se- 

25 quence d'ADN comprenant un systeme re- 

gulateur de I'ubiquitine vegetale. le systfeme 
regulateur contenant un element de choc 
thermique et un intron, et 
b) un gbne de structure, pouvant etre expri- 

30 me par un vegetal, le gene de structure 

etant place sous le controle regulateur du 
systfeme regulateur de I'ubiquitine vegetale. 

11. Construction d'ADN selon la revendication 10, 
35 dans laquelle le gene de structure code pour 

un g^ne qui ne se trouve pas sous le controle 
d'un choc thermique dans la nature. 

12. Procede d'expression constitutive d'un gene 
40 de structure et d'am^lioration. Induite par une 

presslon de selection, de I'expression de ce 
gen© d© structure dans une cellule vegetale, 
qui comprend les etapes consistant : 

(a) h transformer la cellule vegetale, h I'aide 
45 d'un© construction d'ADN comprenant un 

systeme regulateur de I'ubiquitine vegetale. 
dans lequel se trouve un element de choc 
thermique et un intron, et un gfene de struc- 
ture pouvant §tre exprime par le vegetal, se 
50 trouvant sous le controle regulateur du sys- 

teme regulateur de I'ubiquitine vegetale. et 

(b) a appliquer selectivement des conditions 
de pression k la cellule vegetale transfor- 
mee, d© fagon a induire une amelioration de 

55 I'expression du gfene de structure. 
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@ Plant ubiqultin promoter system. 

@ A DNA segment from the upstream untranscrlbed region 
or a maize ubiqultin gene Is disclosed. This ubiqultin promoter 
region, which comprises heat shock consensus elements. 
Initiates and regulates the transcription of genes placed under 
Its control. Recombinant DNA molecules are also described in 
which a ubiqultin promoter Is combined with a plant expressible 
structural gone for regulated expression of the structural gene 
and for regulated control of expression when stressed with 
elevated temperatures. Such recombinant DNA molecules are 
Introduced into, plant tissue so that the promoter/structural 
gene combination is expressed. 
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Description 



PLANT UBIQUmN PROMOTER SYSTEM 



FIELD OF THE INVENTION 

The Invention Is in the area of plant molecular 
biology and concerns plant genetic engineering by 
recombinant DNA technology. The Identification and 
characterization of a segment of DNA from the 
upstream nontranscribed region of a plant ubiquttin 
gene are described. This segment Is capable of 
Initiating and driving the transcription of nearby plant 
expressible genes In recombinant DNA-containing 
tissue from both monocotyledonous and dicotyle- 
donous plants. The described DNA segment wit! 
enable the selective expression and regulation of 
desired structural genes in plant tissue. 

BACKGROUND OF THE INVENTION 

Ubiquitin Is an 8.5 kDa protein found in eukaryotic 
cells in either the free, monomerlc state or covalentiy 
joined to various cytoplasmic, nuclear or membrane 
proteins. This protein contains 76 amino acid 
residues and its amino acid sequence is conserved 
to an unusually high extent. The sequence of 
ubiquitin is identical between species as diverse as 
human, cow, Mediterranean fruit fly. Xenopus and 
chlci<en (U. Bond and M. Schlesinger (1986) MoL 
Ceil. Biol. 5:949-958). Yeast and human ubiquitin 
differ by only three different amino acids (K. 
Ozkaynal^ et al. (1984) Nature 312: 663-666). while 
plant ubiquitin differs from that of yeast by two amino 
acids. Based on this two or three amino acid 
difference in sequence, there appear to be at least 3 
types of ubiquitin - animal, plant and yeast. 

Ubiquitin is found in three major cellular compart- 
ments -the cytoplasmic membrane, the cytoplasm 
and the nucleus. This protein Is required for 
ATP-dependent degradation of Intracellular proteins, 
a non-lysosomal pathway to eliminate from the cell 
those proteins that are damaged or abnormal as well 
as normal proteins having a short half-life (A, 
Hershko et al. (1984) Proc. Natl. Acad. Sci. USA 
81 :1619-1623: D. FInley et al, (1985) Trends Biol. ScL 
10:343-347). Ubiquitin binds to a target protein, 
tagging it for degradation. The covalent attachment 
is through Isopeptlde linkages between the carbox- 
yltermlnus (glycine) in ubiquitin and the e-amino 
group of lysyl side chains in the target proteins. 

Ubiquitin also plays a role in the cellular response 
to stresses, such as heat shock and increase in 
metal (arsenlte) concentration (D. FInley et al. (1985) 
supra ). Most living cells respond to stress (for 
example, exposure to temperatures a few degrees 
above normal physiological temperatures, or to 
elevated concentrations of heavy metals, ethanol, 
oxidants and amino acid analogs) by activating a 
small set of genes to selectively synthesize stress 
proteins, also called heat shock proteins. In most 
organisms these stress proteins were found to have 
subunit molecular weights of 89, 70 and 23 kDa (U. 
Bond and M. Schlesinger (1985) supra) . Ubiquitin, 
with a molecular weight of approximately 8.5 kDa. 
also responds to stress, since in different species 



(yeast, mouse, gerbil and chicken embryo fibro- 
blasts) the levels of ubiquitin mRNA and ubiquitin 
5 protein increase as a result of different stress 
conditions. 

In eukaryotic systems the expression of genes is 
directed by a region of the DNA sequence called the 
promoter. In general, the promoter is considered to 
10 be that portion of the DNA. upstream from the 
coding region, that contains the binding site for RNA 
polymerase tl and initiates transcription of the DNA. 
The promoter region also comprises other elements 
that act as regulators of gene expression. These 
15 include a TATA box consensus sequence in the 
vicinity of about -30, and often a CAAT box 
consensus sequence at about -75 bp 5' relative to 
the transcription state site, or cap site, which Is 
defined as +1 (R. Breathnach and P. Chambon 
20 (1981) Ann. Rev. Biochem. 50:349-383; J. I^essing et 
al (1983) in Genetic Engineering of Plants , eds. T. 
Kosuge, CP. Meredith and A. Hollaender. 
pp. 211-227). In plants the CAAT box may be 
substituted by the AGGA box (J, Messing et al. 
25 (1983) supra ). Other regulatory elements that may 
be pres'entare those that affect gene expression in 
response to environmental stimuli, such as illumina- 
tion or nutrient availability, or to adverse conditions, 
such as heat shock, anaeroblosis or the presence of 
30 heavy metal. In addition, there may be present DNA 
sequences which control gene expression during 
development, or In a tissue-specific fashion. Other 
regulatory elements that have been found are the 
enhancers (in animal systems) or the upstream 
35 acthfating sequences (in yeast), that act to elevate 
the overall expression of nearby genes In a manner 
that is Independent of position and orientation with 
respect to the nearby gene. Sequences homologous 
to the animal enhancer core consensus sequence. 
40 5'-GGTGTGGAAA(orTTT)G-3', have been descnbod 
in plants, for example. In the pea iegumin gene at 
about position -180 relative to the transcription state 
site (G. Lycett et al. (1984) Nucleic Acids Res, 
12*4493-4506) and in the maize Adh1 and Adhg 
45 p'nes at about -200 and -170 bp. respectively, from 
the transcription state site. In general, promoters are 
found 5'. or upstream, relative to the start of the 
coding region of the con-esponding gene and this 
promoter region, comprising all the ancillary regula- 
50 tory elements, may contain between 100 and 1000 or 
more nucleotides. 

Of the regulatory elements controlling gene 
expression, the heat shock element Is perhaps one 
of the most widely studied. Although the universality 
55 of cellular response to heat shock has been known 
for almost a decade, very little Is known yet about the 
function of the heat shock proteins selectively 
synthesized by the stressed cell. The Induction of 
stress protein synthesis occurs at a iranscriptlon 
50 level and the response has been found to be similar 
in bacteria, fungi, Insects and mammals (E. Craig 
(1985) CTC Crit. Rev. Biochem. 18:239-280). In 
addition to the synthesis and accumulation of the 
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Classic heat shock proteins In response to stress, 
cells that are stressed also synthesize proteases 
and ubiquitin. In E. coll , a 94 kDa enzyme that has an 
ATP-dependent proteolytic activity Is encoded by 
the Ion ( cap R) gene whose expression is under 
controT of the heat shock regulon (E. Ozkaynak et al. 
(1984) Nature 312:663-666). In chicken embryo 
fibroblasts {U. Bond et al. (1985) Mol, Cell. Biol. 
5:949-956) the ubiquitin mRNA level increased five 
Fold after heat shock or after exposure to 50 ^iM 
arsenlte. Each mRNA comprises tandemly repeated 
identical polypeptides which, upon translation as a 
polyublquitin molecule, gives rise to multiple ubi- 
quitin molecules, offering a distinctive mechanism 
for amplifying genetic information. This elevated level 
of ubiquitin mRNA does not persist during the 
recovery phase after heat shock, Indicating a 
transient role for free ubiquitin during the stress 

response. /^noo^ 
It has been postulated (J. Anathan et al. (1986) 
Science 232:522-524) that metabolic stresses that 
trigger the activation of heat shock protein genes act 
through a common mechanism. The metabolic 
stresses that activate heat shock genes cause 
denaturation of Intracellular proteins; the accunr»ula- 
tlon of abnormal proteins acts as a signal to activate 
heat shock genes. A role for ubiquitin In targeting 
abnormal proteins for degradation, as well as for 
different proteolytic enzymes, would be compatible 
with such a model of heat shock protein gene 
regulation. 

Most of the early work on heat shock genes was 
done with Drosophila species. In particular, the 
Drosophila hsp70 gene was used widely In recombi- 
nant studies. In homologous systems, the Droso- 
phila hsp70 gene was fused to the E. coil p-galacto- 
sidase structural gene to allow the activity of the 
hybrid gene to be distinguished from the five 
resident hsp70 heat shock genes in the recipient 
Drosophila. Drosophila heat shock genes were also 
Introduced Into heterologous systems, e.g., in 
monkey COS cells and mouse cells (H. Pelham 
(1982) Cell 30:517-628). Regulation by heat shock 
was observed m the hybrid hsp70-!ac Z gene which 
was integrated into the Drosophila germ line and into 
which a 7 kb E.coli p-galactosidase DNA fragment 
was Inserted Into the middle of the hsp70 structural 
gene. The resultant p-galactosidase activity In the 
transformants was shown (J. Lis el al, (1983) Cell 
35:403-410) to be regulated by heat shock. 

^The DNA sequence conferring heat shock re- 
sponse was Identified by deletion analysis of the 
Drosophila hsp70 heat shock promoter to be 
5'-CTGGAAT-_TrCtAGA-3' (H. Pelham et al, (1982) 
in Heat Shock From Bacteria to Man , Cold Spring 
Harbor Laboratory, pp. 43-48) and is generally 
located In the -66 through -47 region of the gene or 
approximately 26 bases upstream of the TATA box. It 
was further demonstrated that a chemically syn- 
thesized copy of this element, when placed up- 
stream of the TATA box of the herpes virus 
thymidine kinase gene in place of the normal 
upstream promoter element, was sufficient to confer 
heat Induclblilty upon the thymidine kinase gene In 
monkey COS cells and in Xenopus oocytes. (The 



thymidine kinase gene is normally not heat induc- 
ible.) These heat shock sequences Interact with heat 
shock specific transcription factor(s) which allow 
the Induction of heat shock proteins (C. Parker et al. 
5 (1984) Ceil 37:273-283). Inducers of heat shock 
genes could be factors that alter (decrease) the 
concentration of heat shock proteins within the cell 
and, thus, control the transcription and translation of 
heat shock genes. 
10 In higher plants, the stress response was demon- 
strated by increased protein synthesis in response 
to heat shock In soybean, pea, millet, corn, sun- 
flower, cotton and wheat (T. Bamett et al. (1980) 
Dev. Genet. 1 :331 -340; J. Key et al. (1981 ) Proc. Natl. 
15 Acad. Sci. USA 78:3526-3530). The major differen- 
ces In heat shock response seen among plant 
species are: (a) the amount of total protein syn- 
thesized in response to stress, (b) the size distribu- 
tion of the different proteins synthesized, (c) the 
20 optimum temperature of induction of heat shock 
proteins and (d) the lethal (breakpoint) temperature. 
High molecular weight proteins are found to be 
electrophoretlcally similar among different species. 
The low molecular weight (15-27 kDa) heat shock 
25 proteins show more electrophoretic heterogeneity 
between species. In plants, the higher molecular 
weight proteins resemble those produced in Droso- 
phila. There Is a marked difference, however, in the 
complexity of the low molecular weight heat shock 
30 proteins between plants and Drosophila. Four heat 
shock proteins. 22. 34, 36 and 27 kDa. are 
synthesized In Drosophila. whereas soybean pro- 
duces over 20 heat shock proteins having molecular 
weights in the range of 1 5-1 8 kDa. The low molecular 
35 weight protein genes in soybeans are the most 
actively expressed and coordinately regulated genes 
under heat shock conditions (F. SohoffI et al. (1982) 
J. Mol. Appl. Genet. ^:3Q^'Z^4). 

Key et al. (EPO Application No. 85302593.0. filed 
40 April 12, 1985) have studied the promoter region of 
plant heat shock genes. Four soybean heat shock 
genes (three genes coding for 15-18 kDa heat shock 
genes (three genes coding for 1 5-1 8 kDa heat shock 
proteins and one genes coding for a 17.3 kDa heat 
45 shock protein) were cloned and sequenced. The 
coding sequences and flanking sequences of the 
four heat shock genes were determined. The 
promoter regions of these four genes were sub- 
cloned, linked to a T-DNA shuttle vector and 
50 transfen-ed into A^robacterium tumefaciens . One of 
the recombinant clones of a soybean heat shock 
gene coding for a 15-18 kDa protein contained an 
open reading frame of 462 nucleotides and a 291 
nucleotide promoter region upstream of the ATG 
55 translation Initiation codon. The promoter Included 
the TATA box, the CAAT box. the transcription 
initiation site and a heat shock consensus sequence 
131-144 nucleotides upstream of the ATG translation 
start codon with the sequence 
60 5'-CT_GAA_TTC__AG-3'. Only three of the four 
clones showed substantial homology In the pro- 
moter region, but there were strong similarities 
between the heat shock consensus sequences of all 
four clones. Significantly, the coding sequence, the 
65 upstream promoter region and the downstream 
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flanking region of the four soybean heat shock 
genes had almost no resemblance to the corre- 
sponding regions of Drosophlla heat shock genes. 
Although there were similarities between the con- 
sensus sequence of the promoter region from 
Drosophlla and soybean heat shock genes, the 
promoter regions of soybean heat shock genes did 
not possess the inverted repeat sequences charac- 
teristic of Drosophiia genes. 

The promoter region from the soybean heat shock 
genes was used to activate a soybean gene and a 
foreign gene (one normally not found In soybean) 
and to show regulation of the response by stress 
(Key et ai.. EPO Application No. 85302593.0, filed 
April 12. 1985), The promoter was isolated from the 
soybean SB 13 heat shock gene as a DNA fragment 
extending 65 bp downstream from the start of 
transcription to Include a major portion of the 
untranslated leader sequence but not the start 
codon for translation. A P-galactosidase gene was 
placed under the control of the heat shock promoter 
within the T-DNA of the Tl-plasm!d In a stable form 
within A. tumefaclens , and then was transfen-ed to a 
plant or plant cell culture. The actuality of DNA 
transfer was recognized by the expression of the 
P-galactosidase gene as the production of a blue 
polor after heat treatment In a medium containing 
the 5-bromo-4-chlbro-3-indolyl-p-D-gaIactosldase 
substrate molecule (M. Rose et al. (1981) Proc. Natl. 
Acad, Scl. USA 78:2460-2464). 

Experimentation with cross expression wherein a 
gene from one plant species Is examined for 
expression in a different species adds a further 
dimension to the understanding of specific function. 
These experiments may embody the Insertion of a 
gene under the control of Its own promoter or of a 
gene artificially fused to a different or unnatural 
promoter. In 1983 Mural et al. (Science 222:476-482) 
obtained expression of the phaseolin gene from 
Phaseolus vulgaris L. In sunflower (Helianthus) 
tissue under two sets of conditions: (i) when the 
Phaseolin gene was under the control of Its own 
promoter and (ii) when the gene was spliced to, and 
under the control of a T-DNA promoter. In subse- 
quent experiments It was shown that the phaseolin 
structural gene under the control of its natural 
promoter could be expressed in tobacco and that 
the tissue-specific expression in the heterologous 
host (tobacco) was similar to that in the native host 
(bean) (C. Sengupta-Gopalen et al. (1985) Proc. 
Natl. Acad. Scl. USA 82:3320-3324), 

In later experiments (J. Jones et al. (1985) EMBO 
J. 4:2411-2418) the expression of the octopine 
synThetase gene (ocs) was described In both 
regenerated transformed homologous (petunia) and 
heterologous (tobacco) plants. In this study the ocs 
gene was fused to the promoter of a petunia 
chlorophyll a/b binding protein. Cross-expression 
was obtained by W. Gurley et al. (1986) Mol. Cell. 
Biol. 6:559-565; and Key et al.. EPO Application 
No. 85302593.0. filed April 12, 1985, who reported 
strong transcription In sunflower tumor tissue of a 
soybean heat shock gene under control of its own 
promoter. In this case functional activity was 
measured as the correct thermal Induction re- 



sponse. 

The first evidence for transcription initiated from a 
monocotyledon promoter in a dicotyledon host plant 
was published by Matzke et ai. (1984) EMBO J- 
5 3-1525-1531. These workers cloned the maize zein 
Z4 gene and Introduced it on aTl-derived vector into 
sunflower stemlets. The ensuing zein mRNA could 
then be translated in a wheat germ system but not in 
the transformed sunflower call!. 
10 In a later study the vrheat gene whAB1.6 encoding 
the major chlorophyll a/b binding protein was cloned 
Into a T-DNA-containIng vector and Iransfenred to 
both petunia and tobacco (G. I^mppa et al. (1985) 
Nature 316:750-752). Expression was obtained In 
16 both the monocotyledon and dicotyledon hosts and 
was determined to be light-induced and tissue-spe- 
cific. In a more recent study, Rochester et al, (1986) 
EMBO J. 5:451-458) obtained expression of the 
maize heat shock hsp70 gene in transgenic petunia, 
20 The maize hsp70 mRNA was synthesized only in 
response to thermal stress. So far. these three 
studies constitute the total number of published 
reports desorlbing successful expression of mon- 
ocot genes In transgenic dicot plants. However, 
25 there are also negative reports describing minimal or 
no expression of maize alcohol dehydrogenase gene 
in tobacco hosts (Llewellyn et al (1985) in Molecular 
F orm and Function of the Plant Genome . L. van 
Vloten-Dotlng. G.S. Groot and T. Hall (eds.). Plenum 
30 Publishing Corp., pp. 593-608; J.Q. Bills et al. (1987) 
EMBO J. 6:11-16), suggesting a possible Inherent 
species-specific difference between monocot and 
dicot promoters. . 
The heat shock response is believed to provide 
35 thermal protection or thermotolerance to othenvise 
nonpermissive temperatures (M. Schlesinger et al. 
(1982) in Heat Shock from Bacteria to Man , Cold 
Spring Harbor Laboratory. Cold Spring Harbor. New 
York p 329). A permissive heat shock temperature 
40 is a temperature which Is high enough to induce the 
heat shock response but not high enough to be 
lethal. Thermotolerance in plant seedlings can be 
attained by different treatment regimes: (a) a 1 to 2 
hour exposure to continuous heat shock at 40° C 
45 followed by a 46'* C incubation, (b) a 30 minute heat 
shock at 40' C followed by 2 to 3 hours at 28° C prior 
to the shift to 45° C, (c) a 10 minute heat shock at 
45'' C followed by about 2 hours at 28° C prior to the 
shift to 45" 0 and (d) treatment of seedlings with 50 
SO uM arsenlte at 28'' C for 3 hours or more prior to the 
shift to 45° G. During the pretreatment pnor to 
incubation at the potentially lethal temperature, heat 
shock proteins are synthesized and accumulated. 
Also heat shock mRNA and protein syntheses 
55 occur at 45** C, If the plant seedling is preconditioned 
as described above. When the temperature is shifted 
back to physiological levels (e.g.. 28° 0). normal 
transcription and translation are resumed and after 3 
to 4 hours at normal temperature, there is no longer 
60 detectable synthesis of heat shock proteins (J. Key 
et al (1981) Proc. Natl. Acad. Sci. USA 
78*3526-3530; M. Schlesinger et al. (1982) Trends 
Bfochem. Scl. 1:222-225). The heat shock proteins 
that were synthesized during the 40° C heat shock 
65 treatment are very stable and are not Immediately 
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degraded. 

Although ubiqultin Is regulated in response to 
environmental stress, including heat shock, the 
regulation of ubiquitin transcription differs from that 
of classical heat shock protein transcripts. Both 
ubiquitin and heat shock protein mRNA levels are 
elevated in response to cellular stress. However, 
whereas classical heat shock proteins accumulate 
during heat shock and persist during the recovery 
phase, ubiquitin mRNAs accumulated during heat 
shock are rapidly degraded within hours after stress 
treatment. This unstable mRNA transcript suggests 
a specialized but transient role for ubiquitin during 
heat shock, and Implicates a unique DNA sequence 
In the ubiquitin gene promoter region, specifying 
specialized regulatory control during cellular re- 
sponse to stress. 

SUMMARY OF THE INVENTION 

The primary object of this invention Is to provide 
novel DNA segments and constructions comprising 
a regulatory promoter system which will enable 
those skilled in the art to selectively express 
structural genes in plant tissue. The promoter 
comprises the DNA sequences from the 5' nontran- 
scribed regions of plant ubiquitin genes that Initiate 
and regulate the transcription of genes placed under 
its control In Its preferred embodiment, the pro- 
moter sequence Is derived from the upstream region 
of the ubiquitin gene from maize. 

The Isolation and characterization of a promoter 
which is active In plants to control and regulate the 
expression of a downstream gene Is described in the 
present work. This DNA sequence is found as a 
naturally occurring region upstream of the ubiquitin 
structural gene isolated from a maize genomic 
library. The transcription start site or cap site as 
determined by S1 nuclease mapping is designated 
as base 1 and the sequences embodied within about 
899 bases 5' of the transcription start site plus about 
1093 bases 3' of the cap site but 5' of the translation 
start site constitute the ubiquitin promoter. Located 
within this approximately 2kb promoter region are a 
TATA box (-30), two overtapping heat shock consen- 
sus elements (-204 and -214). an 83 nucleotide 
leader sequence immediately adjacent to the tran- 
scription start site and an Intron extending from 
base 64 to base 1093. 

A further object of this Invention Is to provide a 
recombinant DNA molecule comprising a plant 
expressible promoter and a plant expressible struc- 
tural gone, wherein the structural gene Is placed 
under the regulatory control of all transcription 
initiating and activating elements of the promoter. In 
particular, the plant ubiquitin promoter can be 
combined with a variety of DNA sequences, typically 
structural genes, to provide DNA constructions for 
regulated transcription and translation of said DNA 
sequences and which will allow for regulated control 
of expression when stressed wltti elevated tempera- 
tures. 

Such recombinant DNA molecules are Introduced 
Into plant tissue so that the promoter/structural 
gene combination Is expressed, it Is contemplated 
that the method of the present Invention is generally 



applicable to the expression of structural genes In 
both monocotyledonous and dicotyledonous plants. 

BRIEF DESCRIPTION OF THE FIGURES 

5 

Figure 1 is an analysis of a maize ubiquitin 
genomic clone. (A) Restriction map of ubiquitin 
gene, 7.2b1. (B) Restriction map of two sub- 
cloned PstI fragments of ubiquitin gene 1. (C) 

10 Schematic representation of maize ubiquitin 

gene 1 organization. The 5' untranslated exon is 
Indicated by the open box and the tandem 
ubiquitin coding regions are indicated by the 
numbered boxes. 

/5 Figure 2 documents the DNA sequence and 

the deduced amino acid sequence of ubiquitin 
gene 1 . The start of transcription as determined 
by SI nuclease mapping is denoted as base 1. 
Sequences representing the putative "TATA" 

20 box (-30) and the overiapplng heat shock 

consensus sequences (-214 and -204) are 
underiined. The Intron extends from base 84 to 
base 1093 and the polyubiquitin protein coding 
sequence extends from base 1094 to 2693. 

2$ Figure 3 demonstrates that all seven of the 

ubiquitin coding repeats encode an identical 
amino acid sequence. The nucleotide sequence 
of the seven repeats is shown aligned under the 
derived amino acid sequence. An additional 

30 77th amino acid, glutamine, Is present in the 7th 

repeat preceding the stop codon. A polyadeny- 
lation signal, AATAAT, is present in the 3' 
untranslated region, 113bp from the stop 
codon. 

35 Figure 4 is a diagrammatic presentation of 

the procedure used for the construction of the 
maize ubiquitin promoter region-chloramphenl- 
col acetyl transferase (CAT) gene fusion. 

Figure 5 presents an assay for the ubiquitin 

40 promoter. CaMV-CAT. cauliflower mosaic virus 

35S promoter -CAT gene fusion; UBQ-CAT. 
maize ubiquitin promoter - CAT gene fusion. 

DETAILED DESCRIPTION OF THE INVENTION 

45 The following definitions are provided In order to 
remove ambiguities as to the intent or scope of their 
usage In the specltication and claims. 

Expression refers to the transcription and/or 
translation of a structural gene. 

SO Promoter refers to the nucleotide sequences at 
the 5' end of a structural gene which direct the 
initiation of transcription. Promoter sequences are 
necessary, but not always sufficient, to drive the 
expression of a downstream gene. In general, 

55 eukaryotic promoters include a characteristic DNA 
sequence homologous to the consensus 5'-TA- 
TAAT-3' (TATA) box about 10-30 bp 5' to the 
transcription start (cap) site, which, by convention, 
is numbered -1-1. Bases 3' to the cap site are given 

60 positive numbers, whereas bases 5' to the cap site 
receive negative numbers, reflecting their distance 
from the cap site. Another promoter component, the 
CAAT box. Is often found about 30 to 70 bp 5' to the 
TATA box and has homology to the canonical form 

65 5'-CCAAT-3' (R. Breathnach and P. Chambon (1981) 
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Ann. Rev. Biochem. 50:349-383). In plants the CAAT 
box Is sometimes replaced by a sequence known as 
the AGGA box, a region having adenine residues 
symmetrically flanking the triplet G(orT)NG (J. 
Messing et al. (1983). In Genetic Engi neering of 
Plants . T. Kosuge et al. (eds.). Plenum Press, 
pp. 211-227). Other sequences conferring regulatory 
influences on transcription can be found within the 
promoter region and extending as far as 1000 bp or 
more from the cap site. 

Regulatory Control refers to the modulation of 
gene expression Induced by DMA sequence ele- 
ments located primarily, but not exclusively, up- 
stream of (5' to) the transcription start site. 
Regulation may result in an ail-or-nothing response 
to environmental stimuli, or It may result in variations 
in the level of gene expression. In this invention, the 
heat shock regulatory elements function to enhance 
transiently the level of downstream gene expression 
in response to sudden temperature elevation. 

Placing a structural gene under the regulatory 
control of a promoter or a regulatory element means 
positioning the structural gene such that the 
expression of the gene Is controlled by these 
sequences. In general, promoters are found posi- 
tioned 5' (upstream) to the genes that they control. 
Thus, in the construction of heterologous promoter/ 
structural gene combinations, the promoter Is 
preferably positioned upstream to the gene and at a 
distance from the transcription start site that 
approximates the distance between the promoter 
and the gene it controls in its natural setting. As Is 
known in the art, some variation in this distance can 
be tolerated without loss of the promoter function. 
Similarly, the preferred positioning of a regulatory 
element with respect to a heterologous gene placed 
under Is control reflects its natural position relative 
to the structural gene it naturally regulates. Again, as 
Is known In the art. some variation in this distance 
can be accommodated. 

Promoter function during expression of a structu- 
ral gene under Its regulatory control can be tested at 
the transcriptional stage using DNA-RNA hybndiza- 
tion assays ("Northern" blots) and at the transla- 
tional stage' using specific functional assays for the 
protein synthesized (for example, by enzymatic 
activity or by immunoassay of the protein). 

Structural gene Is that portion of a gene compris- 
fng DNA segment encoding a protein, polypeptide or 
a portion thereof, and excluding the 5' sequence 
which drives the initiation of transcription. The 
structural gene may be one which is normally found 
in the cell or one which is not normally found In the 
ceflular location wherein It is introduced, in which 
case it Is termed a heterologous gene . A heterolo- 
gous gene may be derived In whole or In part from 
any source known to the art. Including a bacterial 
genome or episome. eukaryotlc, nuclear or plasmid 
DNA, cDNA, viral DNA or chemically synthesized 
DNa! a structural gene may contain one or more 
modifications in either the coding or the untrans- 
lated regions which could affect the biological 
activity or the chemical structure of the expression 
product, the rate of expression or the manner of 
expression control. Such modifications include, but 



are not limited to, mutations. Insertions, deletions 
and substitutions of one or more nucleotides. The 
structural gene may constitute an uninterrupted 
coding sequence or it may include one or more 

5 introns, bound by the appropriate splice junctions. 
The structural gene may be a composite of seg- 
ments derived from a plurality of sources, naturally 
occurring or synthetic. The structural gene may also 
encode a fusion protein. It Is contemplated that the 

10 introduction Into plant tissue of recombinant DNA 
molecules containing the promoter/structural gene/ 
polyadenylatlon signal complex will Include con- 
structions wherein the structural gene and Its 
promoter are each derived from different plant 

75 species. . 

Pl ant Ubiquitin Regulatory System refers to the 
approximately 1 kb nucleotide sequence 5' to the 
translation start site of the maize ublqultln gene and 
comprises sequences that direct initiation of fran- 
co scriptlon. regulation of transcription, control of 
expression level, induction of stress genes and 
enhancement of expression In response to stress. 
The regulatory system, comprising both promoter 
and regulatory functions, is the DNA sequence 
25 providing regulatory control or modulation of gene 
expression. A structural gene placed under the 
regulatory control of the plant ublqultln regulatory 
system means that a structural gene Is positioned 
such that the regulated expression of the gene Is 
30 controlled by the sequences comprising the ubl- 
qultln regulatory system. . 

Polvade nvlatlon signal refers to any nucleic acid 
sequence capable of effecting mRNA processing, 
usually characterized by the addition of polyadenyllc 
35 acid tracts to the 3'-ends of the mRNA precursors 
The polyadenylatlon signal DNA segment may itself 
be a composite of segments derived from several 
sources, naturally occurring or synthetic, and may 
be from a genomic DNA or an mRNA-derlved cDNA. 
40 Polyadenylatlon signals are commonly recognized 
by the presence of homology to the canonical form 
5'-AATAA-3', although variation of distance, partial 
"readthrough," and multiple tandem canonical se- 
quences are not uncommon (J. Messing et ai. 
46 supra). It should be recognized that a canonical 
^^E5i^denylatlon signal" may in fact cause transcrip- 
tional termination and not polyadenylatlon perse (C. 
Montell et al. (1983) Nature 305:600-605). 

Plant tissue includes differentiated and undif- 
ferentiated tissues of plants, Including, but not 
limited to roots, shoots, leaves, pollen, seeds, tumor 
tissue and various forms of cells In culture, such as 
single cells, protoplasts, embryos and callus tissue. 
The plant tissue may be In plants or In organ, tissue 

SS or cell culture. -^^^y.u, «r 

Homology, as used herein, refers to identity or 
near identity of nucleotide and/or ammo acid 
sequences. As Is understood in the art, nucleotide 
mismatches can occur at the third or wobble base In 
60 the codon without causing amino acid substitutions 
in the final polypeptide sequence. Also, minor 
nucleotide modifications (e.g., substitutions, Inser- 
tions or deletions) in certain regions of the gene 
sequence can be tolerated and considered insignifi- 
es cant whenever such modifications result In changes 
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In amino acid sequence thai do not alter the 
functionality of the final product. It has been shown 
that chemically synthesized copies of whole, or parts 
of, gene sequences can replace the corresponding 
regions In the natural gene without loss of gene 
function. Homologs of specific DNA sequences may 
be Identified by those skilled in the art using the tost 
of cross-hybrldizatlon of nucleic acids under condi- 
tions of stringency as Is well understood In the art 
(as described in Names and Higgens (eds.) (1985) 
Nucleic Acid Hybridisation . IRL Press. Oxford, UK). 
Extent of homology Is often measured In terms of 
percentage of identity between the sequences 
compared. Thus, In this disclosure it will be 
understood that minor sequence variation can exist 
within homologous sequences. 

Derived from Is used herein to mean taken, 
obtained, received, traced, replicated or descended 
from a source (chemical and/or biological). A 
derivative may be produced by chemical or biologi- 
cal manipulation (Including but not limited to sub- 
stitution, addition, insertion, deletion, extraction. 
Isolation, mutation and replication) of the original 
source. 

Chemically synthesized , as related to a sequence 
of DNA, means that the component nucleotides 
were assembled In vitro . Manual chemical synthesis 
of DNA may be accomplished using well established 
procedures (M. Caruthers (1983) In Methodology of 
DN A and RNA Sequencing . Welssman (ed.). Praeger 
Publishers (New York) Chapter 1), or automated 
chemical synthesis can be performed using one of a 
number of commercially available machines. 

Heat shock elements refer to DNA sequences that 
regulate gene expression in response to the stress 
of sudden temperature elevations. The response is 
seen as an immediate albeit transitory enhancement 
In level of expression of a downstream gene. The 
original work on heat shock genes was done with 
Drosophila but many other species including plants 
(T. Barnett at al. (1980) Dev. Genet. 1:331-340) 
exhibited analogous responses to stress. The 
essential primary component of the heat shock 
elements was described in Drosophila to have the 
consensus sequence 5'-CTGGAAT-TTCTAGA-3' 
and to be located In the region between residues -66 
through -47 bp upstream to the transcriptional start 
site (H. Pelham and M. Bienz (1982) supra ). A 
chemically synthesized oligonucleotide copy of this 
consensus sequence can replace the natural se- 
quence in conferring heat shock inducibility. In other 
systems, multiple heat shock elements were identi- 
fied within the promoter region. For example, 
Rochester et al. (1986) supra recognized two heat 
shock elements in the maize hsp 70 gene. 

Leader sequence refers to a DNA sequence 
comprising about 100 nucleotides located between 
the transcription start site and the translation start 
site. Embodied within the leader sequence is a 
region that specifies the ribosome binding site. 

Introns or Intervening sequences refer In this work 
to those regions of DNA sequence that are tran- 
scribed along with the coding sequences (exons) 
but are then removed In the formation of the mature 
mRNA. Introns may occur anywhere within a tran- 



scribed sequence - between coding sequences of 
the same or different genes, within the coding 
sequence of a gene, Inten-upting and splitting its 
amino acid sequences, and within the promoter 

5 region (5' to the translation start site). Introns In the 
primary transcript are excised and the coding 
sequences are simultaneously and precisely llgated 
to form the mature mRNA. The junctions of introns 
and exons form the splice sites. The base sequence 

10 of an Intron begins with GU and ends with AG. The 
same splicing signal is found In many higher 
eukaryotes. 

The present invention relates to the development 
of a recombinant vector useful for the expression of 
15 DNA coding segments in plant cells. The vector 
herein described emptoys a maize ubiquitin pro- 
moter to control expression of an inserted DNA 
coding segment. The transcriptional regulatory se- 
quences may be combined with an extrachromoso- 
20 mai replication system for a predetermined host. 
Other DNA sequences having restriction sites for 
gene Insertion may be added to provide a vector for 
the regulated transcription and translation of the 
inserted genes In said host. The vector may also 
255 include a prokaryotlc replication system allowing 
amplification in a prokaryotlc host, markers for 
selection and other DNA regions. This would allow 
large quantities of the vector to be grown in well 
characterized bacterial systems prior to transfor- 
30 ming a plant or mammalian host. The principles for 
construction of a vector having proper orientation of 
the promoter and coding sequences with respect to 
each other are matters well-known to those skilled in 
the art. In some situations it may be desirable to Join 
35 the promoter system to a desired structural gene 
and to Introduce the resultant construct DNA 
directly into a host Methods for such direct 
transfers include, but are not limited to, protoplast 
transformation, eiectroporation, direct injection of 
40 DNA Into nuclei and co-transformation by cateium 
precipitation. 

This invention comprises the first report of an 
Isolated and characterized plant ubiquitin promoter. 
The maize ubiquitin promoter as described in the 
4S present work includes the RNA polymerase recogni- 
tion and binding sites, the transcriptional initiation 
sequence (cap site), regulatory sequences respon- 
sible for inducible transcription and an untranslat- 
able intervening sequence (Intron) between the 
50 transcriptional start site and the translattonal initia- 
tion site. Two overtapping heat shock consensus 
promoter sequences are situated 5' (-214 and -204) 
of the transcriptional start site. An exon of 83 
nucleotides is located Immediately adjacent to the 
55 cap site and is followed by a large (approximately 
Ikb) intron. 

The ubiquitin promoter along with the ubiquitin 
structural gene can be Isolated on two approxi- 
mately 2kb Pst fragments of the maize genome 

60 (Figure 1 ). The entire fragment can be used to show 
promoter function by monitoring expression of 
mRNA or protein. Introduction of a heterologous 
gene downstream of the ubiquitin translation initia- 
tion codon will result in the expression of a fused 

65 protein. Insertion of a heterologous gene (having Its 
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own start and stop codons) between the ublquttln 
promoter and translation Initiation codon will result 
In the expression of the native polypeptide corre- 
sponding to the inserted gene. The Insertion of the 
desired structural gene Is most conveniently accom- 
plished with the use of blunt-ended linkers at the 
ends of the gene. 

Alternatively, the ubiqultin gene fragment may be 
restricted, particularly at a site immediately preced- 
ing the start of the structural gene or at a site 
preceding the transcription start site. For example. 
In the present invention the promoter fragment was 
derived from the ubiqultin gene as an approximately 
2kb PstI fragment. To ensure that the promoter 
fragrnent is devoid of the translational initiation 
codon, the fragment containing the 5' flanking region 
may be selectively digested with double stranded 
exonuclease under controlled conditions to remove 
a desired number of nucleotide pairs. It is desirable 
to remove the ubiqultin translation initiation codon 
so that translation of the Inserted gene will com- 
mence at its own start site. The isolated (and 
shortened) promoter fragment may then be inserted 
Into the vector using linkers or homopolymer tailing 
to Introduce desired restriction sites compatible 
with the remaining regions of the vector. In general, 
the promoter fragment may be cleaved with specific 
restriction enzymes and the resultant shortened 
DNA fragments tested for promoter function and 
compared to that of the Intact promoter. In addition, 
DNA codons may be added and/or existing sequen- 
ces may be modified to give derivative DMA 
fragments retaining promoter functions. 

The resulting DNA constructs may be useful as 
cloning vehicles for a structural gene of Interest in a 
plant host. In this Invention, the structural gene 
encoding CAT under control of either the maize 
ubiqultin promoter or the cauliflower mosaic virus 
promoter was expressed In both oat and tobacco 
ceils. When the ubiqultin promoter was employed, a 
greater degree of expression was obtained with the 
monocot host than with the dicot host; however, a 
higher level of expression was obtained with dicot 
than with monocot host when the cauliflower mosaic 
virus promoter was utilized. The differential in 
expression levels reflects the Inherent inequality of 
different promoters as well as basic cellular differen- 
ces in regulation of expression and processing 
between monocots and dicots. To date, it is not 
predictable, routine or obvious that a monocot 
promoter will operate In a dicot host cell. 

A wide variety of structural genes may be 
Introduced into the subject DNA cloning vectors for 
the production of desired proteins, such as 
enzymes, hormones and the like, in addition, DNA 
constructs of this type can be used for the enhanced 
production of DNA derived form a particular gene, as 
well as for enhanced production of mRN A which can 
be used to produce cDNA. Such vectors carrying 
specific DNA sequences find wide application and 
are quite versatile; for example, they can be used for 
amplification in bacteria as well as for expression in 
higher cells which allow for additional cellular 
functions. An advantage of utilizing higher euka- 
ryotic recombinant systems to produce commer- 



cially medical and agriculturally desirable proteins is 
that they ensure correct post-translational modifica- 
tions which may otherwise be difficulty to duplicate 
In prokaryotic and lower eukaryotic hosts. 

5 In this Invention the maize ubiqultin promoter was 
shown to function in oat and tobacco, as examples 
of monocots and dicots, respectively, and it is 
conceivable that this promoter can function in yet 
other cells. Such systems include, by way of 

10 example, and without limitation, other cells from 
which ubiqultin genes have bene isolated and found 
to be highly conserved, for example, other monocots 
in addition to maize, dicots other than tobacco 
lower eukaryotic organisms such as yeast and 

15 mammalian cells. The screening of cellular systems 
suitable for use with the maize ubiqultin promoter 
can be accomplished according to the teaching 
herein, without undue experimentation. The con- 
struction of vectors suitable for the expression of a 

20 DNA coding segment in Individual systems has been 
well documented. Shuttle vectors capable of replica- 
tion in more than one host have also been described, 
for example, shuttle expression vectors for both 
yeast and mammalian cells, for plants and animal 

26 cells and for plants and bacterial cells. In addition. It 
will be understood that ubiqultin genes from any 
other system, that are similar to the maize ubiquitin 
gene in functioning as a plant promoter, may be 
employed as the source for the ubiquitin promoter 

30 sequence. 

The present invention also relates to the utilization 
of the maize ubiquitin promoter as a heat shock 
promoter. Two heat shock consensus sequences 
are located upstream of the maize ubiquitin gene at 
35 positions -214 and -204. In many eukaryotes, 
naturally occurring and chemically-synthesized se- 
quences homologous to the heat shock consensus 
sequence have been shown to regulate the induc- 
tion of gene expression. Although the ubiquitin 
40 promoter contains sequences that are identified as 
being those of heat shock elements, the promoter is 
distinguished from classical heat shock promoters 
(1) In having a nontranslated Intron 3 to the 
transcription start site and (2) In regulating ubiquitin 
45 expression constitutlvely as well as inductive^. The 
functional relationship between heat shock ete- 
ments and the presence of a large Intron within the 
promoter region is unknown to prior art. The 
nucleotide distance between these charactenstic 
50 features and also the directlonalily and orientation of 
one element with respect to the other are presumed 
In the present work to be variable, as long as the 
basic promoter function of the derivative regulatory 
fragments remains active. 
55 The presence of an Intron In the promoter region 
has been related to the relative stability of the 
unprocessed mRN A transcript and, indirectly, to the 
level of protein synthesized (Caliis et al. (1987) 
Genes and Development 1:1183-1200), Constitu- 
60 lively expressed ubiqultin mRNA has been reported 
to be maintained at stable levels In chicken embryo 
fibroplasts. whereas ubiquitin mRNA formed in 
response to stress has a half-life of approximately 

65 ^in°yeast four distinct ublqultln-coding loci have 
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been described. Constitutively expressed ubiqultin 
Is encoded by one or more of three of the ubiqultin 
genes, two of which contain an approximately 400 bp 
Intron immediately within the coding region. The 
fourth ubiqultin gene, devoid of a nontranslated 
Intron but comprising multiple heat shock elements, 
functions primarily In Inducing ubiqultin expression 
In response to stress. It has been shown that the 
latter ubiqultin gene does not act constitutively but 
rather is turned on In response to heat shock or 
stress signal (E. Ozkaynak et al. (1987) EMBO J. 
6:1429-1439). 

" In maize, ubiqultin Is encoded by a small multlgene 
family. In this Invention is presented the nucleotide 
sequence of one of the ubiqultin genes. A large 
{approximately 1 kb) Intron between the transcrip- 
tional and the translational start sites as well as 
nucleotide sequences corresponding to consensus 
heat shock sequences are found within the maize 
ubiquitin promoter region. These two regions of 
specialization most probably are Involved In ubiqultin 
synthesis and In regulating the ubiqultin level in 
response to external Influences. The functional 
relationship between the Intron and the heat shock 
elements encompassed within the ubiqultin pro- 
moter Is unknown. It is reported In this Invention that 
the maize ubiqultin promoter regulates the synthesis 
of mRNA both under normal and under heat shock 
conditions and that changes in the regulation of 
transcription account for the enhancement In ubi- 
qultin synthesis after heat shock. 

The following examples are offered by way of 
illustration and not by way of limitation. 

EXAMPLES 

Example 1 : Isolation and Characterization of the 
Maize Ubiquitin Gene 

A. Growth of Plants 

Zea mays Inbred line B73 was grown in moist 
vermicullte for 4 to 5 days at 25'* C in the dark. The 
portion of the seedlings from the mesocotyl node to 
the shoot tip was harvested, frozen in liquid nitrogen 
and stored at -80° C. 

B. RNA Isolation and Analysis 

Total cellular RNA was extracted from frozen 
tissue using the guanidine thiocyanate procedure. 
Poly(A) + RNA was isolated from total cellular RNA 
by passage over a poly U-Sephadex (Bethesda 
Research Laboratories. Gaithersburg, MD) column. 
Total or poIy(A)+ RNA was electrophoresed In 
1.50A) agarose gels containing 30/o (wt/vol) formalde- 
hyde. RNA was transferred to Gene Screen^^ 
(DuPont) by capillary blotting using 25 ml^ sodium 
phosphate (pH6.6). 

Blots were prehybridized In 500/o formamide, 
5XSSC, lOOvig denatured salmon DNA, 40mM 
sodium phosphate (pH6.8). O.StVb BSA and 1o/o SDS. 
Blots were- hybridized In 50o/o formamide. 5XSSC, 
lOOjig/ml denatured salmon DNA. 40mM sodium 
phosphate 9pH6.8) and IQo/o dextran sulfate. 



C. cDNA Library Construction 

Double stranded cDNA was synthesized from 
poly(A) + RNA by a modification of the method of 
Murray et al. (1983) Plant Mol. Biol. 2:75-84. 

5 OIIgo(dG)-taIiing of the double-stranded cDNA and 
annealing of oligo(dC)-tailed cDNA with 
oligo(dG)-talled pBR322 were performed using stan- 
dard technology. The annealed DNA was trans- 
formed Into E. coil HB101 and plated directly onto 

10 nitrocellulose filters (Millipore, HATF; 0.45 \Lm) on 
L-agar plates containing tetracycline (16 ^ig/ml). 

D. Identifications of Ubiquitin cDNA 

A number of cDNAs representing potentially 

fS light-regulated mRNAs were obtained by screening 
a cDNA library by differential hybridization. Several 
of these cDNAS were selected and further screened 
by RNA blot analysis to confirm light regulation. One 
cDNA clone, p6R7.2b1, while not representing a 

20 red-light regulated mRNA. was of Interest because it 
hybridized with three poly(A)-h RNAs of different 
size and abundance. Nick translated p6T7.2b1 
hybridized strongly with the 2100 nucleotide and 
1600 nucleotide mRNAs, but only weakly with the 

25 800 nucleotide transcript. However, hybridization of 
Northern blots with a single stranded 32p-iabeled 
RNA generated by SP6 polymerase transcription of 
linearized pGA210, a plasmid constructed by sub- 
cloning the cDNA insert of p6T7.2b1 into pSP64, 

30 readily detected all three transcripts. 

Since RNA-RNA hybrids are known to be more 
thermally stable than DNA-RNA hybrids, single 
stranded RNA probes rather than nick translated 
DNA probes were used in Northern blot hiybridiza- 

35 ■ tions. Again, the 1600 base transcript was found to 
be about 3 fold less abundant than the 2100 base 
transcript as determined from Northern blots, re- 
gardless of whether the blot was hybridized with nick 
translated DNA or single strand RNA probes. The 

40 smallest transcript was about half as abundant as 
the 2100 base mRNA in blots hybridized with RNA 
probes. 

Restriction fragments were subcloned into 
M13mp18 and/or mp19 and sequenced by the 

45 dideoxynucleotide chain termination method. Ana- 
lysis of the sequence of the clone revealed a single 
long open reading frame of 818 bp terminating in a 
TAA stop codon. The National Biomedical Research 
Foundation library was searched using the D fast P 

SO program for protein sequences homologous with the 
deduced amino acid sequence. Greater than 95o/o 
homology was found between the deduced amino 
acid sequence of the maize cDNA clone and the 
sequences of bovine and human ubiquitin. 

E. Genomic Ubraiy Construction and Screening 

High molecular weight maize DNA was Isolated 
from frozen maize seedling. DNA was partially 
digested with Sau3A. size fractionated and cloned 

$0 Into the Bam HI sites of Charon 35 (Loenen et al. 
(1983) Gene28:171-179). A library of about 2X106pfu 
was screened for recombinant phage containing 
sequences homologous to the ubiqultin cDNA clone 
by in situ plaque hybridization using a ubiquitin 

65 cDNA clone as a hybridization probe. Recombinant 
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phage were purified from broth lysates and phage 
DNA was isolated using standard techniques. Re- 
striction endonuclease digestions were carried out 
according to manufacturers' specifications. 

F. Qenomic Southern Blot Analysis 

Isolated, high molecular weight maize DNA was 
digested with EcoRI. HIndlll and Sad , fractionated 
on 0.70/0 agarose gels and the DNA fragments were 
transferred to Gene Screen Plus™ (DuPont). Filters 
were prehybridized for 6-8h at 65" C in 6XSSC 
(1XSSC = 0.15M NaCI. 0.025M Na Citrate), 5X 
Denhardfs medium, 100 jig/ml denatured, soni- 
cated Salmon DNA, 20 ng/ml polyadenylic acid. 10 
mM disodlum EDTA and O.50/0 SDS. Filters were 
hybridized at 65" C in fresh buffer with ^^p labeled 
plasmid DNA (pCA210). Autoradiography was car- 
ried out at -ao^'C using Kodak X-OMAT AR Film and 
one DuPont Cronex LightningPlus Intensifying 
screen. In each digest, 8 to 10 restriction fragments 
hybridized with the nick translated pCA210 probe, 
suggesting that ubiqultin is coded by a small 
multigene family. Evidence that ubiquitin is encoded 
by a small multigene family has also been reported 
for Xenopus, barley and yeast. 

Two or three fragments in each digest hybridized 
strongly with the probe, whereas the remainder of 
the fragments hybridized wealdy. The differences in 
hybridization intensities may reflect different se- 
quence homology such that the cDNA probe 
hybridizes preferentially to the gene from which it 

was derived. 

Ubiqultin genes from yeast and Xenopus have 
been characterized and have six and at least twelve 
ubiqultin repeats, respectively. Maize genes corre- 
sponding to the three transcripts detected on 
Northern blots may have seven, five and one or two 
ubiqultin repeats In the 2.1, 1.6 and 0.8 kb mRNAs. 
respectively. The maize ubiqultin gene described in 
this invention codes for seven repeats. Thus, the 
difference In hybridization Intensity observed on 
Southern blots may be a result of the restriction 
fragments containing a different number of ubiqultin 
repeats. 

The ubiquitin cDNA clone did not contain EcoRi 
and HIndlll sites. However, the maize ubiqultin genes 
may contain introns which are cut by the restriction 
endonucleases used In the genomic digests. This 
could result in ubiquitin exons being on different 
fragments and could account for the differential 
hybridization Intensities observed In the Southern 
blots. 

G. Ubiquitin Sequence Analysis and Transcription 
Start Site Analysis 

Dideoxynucleotide chain termination sequencing 
was performed using Klenow fragments of DNA 
polymerase 1 (Boehringer Mannheim). A 1 .85 kb^^sti 
fragment of the genomic clone 7.2bl (see Figure 1b) 
homologous to the cDNA clone p6T7.2b.1 and the 2 
kb PstI fragment Immediately upstream, termed 
AC3#9M13RF, were subcloned In both orientations 
Into M13mp19. Recombinant phage RF DNA was 
prepared as for plasmid DNA, Unidirectional pro- 
gressive deletion clones for sequencing both 



strands of these PstI fragments were prepared. 
Exonuclease III and Exonuclease VII were obtained 
from New England Blolabs and Bethesda Research 
Laboratories, respectively. Computer analysis of 
5 DNA sequences was performed using programs 
made available by the University of Wisconsin 
Genetics Computer Group. 

The transcription start site of the ubiquitin gene 
and the 3' Junction of the Intron and exon in the 5' 
10 untranslated region of the gene were determined by 
SI nuclease mapping. Fragments suitable for SI 
probes were prepared as follows. The ubiquitin DNA 
was digested with either Bglll or Xhol. These were 
then labeled with 32p using -^^P ATP (6000 Ci/mmol. 
IS New England Nuclear, Boston. MA) and T4 polynu- 
cleotide kinase (New ENgland Blolabs). Subsequent 
digestion of the Bgll and Xho l kinased fragments 
with PstI and EcoRI, respectively, generated a 946 
bp Psti-Bgin fragment and a 643 bp EcoRI-Xhol 
50 fragrneht. These fragments were separated from the 
other end-labeled fragments by electrophoresis 
through a 50/o polyacrylamlde gel. Slices containing 
the 946 bp Pstl-B^ll and the 643 pb EcoRI-Xhol 
fragments were cut out of the gel and the labeled 
25 DNAs were eluted from the gel. End-labeled DNA 
fragment (10-20 tmole) was hybridized with 2 \ig of 
poly(A) + RNA in 30 p,l of buffer containing 8O0/0 
deionized formamide, 0.4M sodium chloride. 40mM 
PIPES 9pH6.4) and ImM EDTA (pHB.O). The nucleic 
30 acid solution was heated to 80° C for about 16h. 
Ice-cold SI digestion buffer (SOO^il) containing 
280mM sodium chloride. 50mM sodium acetate 
(pH4.6), 4.6mM zinc sulfate and 20 |Ag/ml single 
stranded DNA was added and the DNA digested 
35 with 250 units/ml of SI nuclease (New England 
Nuclear). The reaction was topped with 75^ll of S1 
termination mix containing 2.5M ammonium acetate 
and 50mM EDTA. The products of the S1 nuclease 
digestion were then separated on a 6O/0 polyacryla- 
40 mide/8M urea gel and visualized by autoradiography. 
The end points of the SI protected fragments in the 
ubiqultin sequence were determined by comparison 
with a sequence ladder generated by Maxam/Gllbert 
base modification-cleavage reactions carried out on 
45 the end labeled fragments used as S1 probes. 

The DNA sequence of the maize ubiquitin-1 gene. 
7.2b1. Is shown in Figure 2. The sequence is 
composed of 899 bases upstream of the transcrip- 
tion start site, 1992 bases of 6' untranslated and 
SO intron sequences, and 1999 bases encoding severj 
ubiquitin protein repeats preceding 249 bases of 3 
sequence. A TATA" box Is located at -30 and two 
overiapping heat shock elements are located at -214 
and -240. The DNA sequence of the coding and 3 
55 regions of the ublqultin-1 gene from maize, 7.2b1 , is 
also presented if Figure 3. The derived amino acid 
sequence of maize ubiquitin is shown at the top and 
the nucleotide sequence of the seven ubiqultin 
repeats Is aligned underneath. A schematic of the 
60 organization of the seven complete ubiqultin units in 
the genomic DNA Is shown In Figure 1C. 

The derived amino acid sequences of all of the 
ubiquitin repeats are identical (Rgure 3). The 
terminal (seventh) ubiquitin repeat contains an 
66 additional 77th amino acid, glutamine. prior to the 



10 



19 



EP 0342 926 A2 



20 



TAA Stop codon. This addrtionai amino acid Is not 
found In mature ubiquitin, and Is apparently removed 
during processing. The 77th amino acid of the final 
repeat in the human gene is valine, while in the two 
chicken genes, It Is tyrosine and asparaglne. Yeast 
and barley also have an extra amino acid, asparaglne 
and lysine, respectively; however, an extra amino 
acid was not found In the Xenopus gene. This extra 
amino acid has been proposed to function as a block 
to conjugation of unprocessed polyublqultin to 
target proteins. A polyadenylatlon signal (AATAAT) 
is present In the 3' untranslated sequence. 113 bp 
from the stop codon. 

All seven repeats encode the identical amino acid 
sequence, whereas the nucleotide sequence of the 
repeats varies by as many as 39 nucleotides. This Is 
similar to what has been reported for the nucleotide 
sequence homologies between ubiquitin genes. 
About 800/0 of the nucleotide mismatches between 
ubiquitin repeats are at the third (wobble) base in the 
codon. Alternate codon usage for leucine (5 co- 
dons), serine (3 codons) and arglnlne (3 codons) 
account for the remaining nucleotide mismatches. 

The amino acid sequence for maize ubiquitin is 
Identical to that determined for two other higher 
plants, oat and barley. The sequence differs from the 
sequence reported for yeast by two amino acids: 
alanine for serine substitutions at positions 28 and 
57. The maize sequence is also slightly different form 
that reported for ubiquitin from all animals; substitu- 
tions by serine for proline at position 19, aspartate 
for glutamate at position 24 and alanine for serine at 
position 57. Thus, based on sequence, there appear 
to be three types of ubiquitin: plant, animal and 
yeast. 

Example 2: Construction of plasmid pUB-CAT 
comprising the maize ubiquitin promoter and a 
structural gene 

A. Promoter isolation and construction of pUB-CAT 

The procedure used for construction of the 
Ubiquitin gene upstream region-chloramphenicol 
acetyl transferase (CAT) gene fusion Is outlined in 
Figure 4. The BamHI-Hlndlil restriction fragment 
containing the CAT gene and the nopaline synthase 
(NOS) 3' untranslated region and polyadenylatlon 
signal of pNOS-CAT (Fromm et al. (1985) Proc. Natl. 
Acad. Sci. 82 :5824-5828) was subcloned into Bam HI 
and Hindlll digested pUCia- This construct was 
termed pUB-CAT. 

An approximately 2.0kb PstI fragment immediately 
upstream of the ubiquitin polyprotein coding region 
of the maize ubiquitin gene7.2bl was subcloned into 
M13mp19. This segment of DNA spans nucleotides 
-899 to 1092 of the maize ubiquitin sequence 
documented In Figure 2. This recombinant DNA was 
termed AC3#9M13RF and contains the ubiquitin 
promoter. 5' untranslated leader sequence and 
about Ikb Intron. labeled UBI-5' In Figure 4. 

The ubiquitin promoter-CAT reporter gene fusion 
was constructed by blunt ending with T4 DNA 
polymerase the 2.0kb PstI fragment of 
AC3#9M13RF and cloning this fragment Into Sma l- 
digested pUC18-CAT. Tills construct was termed 



pUB-CAT. 

B. Introduction of Recombinant DNA Into Oat and 
Tobacco Protoplast 

5 Leaves (2g) of 5- to 6-day old etiolated oat 
seedlings were finely chopped with a razor blade. 
The tissue was rinsed several times with digestion 
medium (3mM MES, pH5.7, 10mM calcium chloride, 
0.5M mannitol and 2 mg/ml arginine) and then 

10 incubated for 4h at room temperature with 20 ml 
digestion medium containing 20/o cellulase. The 
tissue was shaken occasionally to release proto- 
plasts. The material was filtered through a 63 |im 
mesh and centrlfuged 5 min at 50xg. The superna- 

15 tant fluid was removed and the protoplasts were 
washed two times with digestion medium and then 
resuspended In electroporation buffer to give 0.5 ml 
of protoplast suspension per electroporation. The 
electroporation buffer consisted of: lOmM HEPES, 

20 pH7.2, 150mM sodium chloride, 4mM calcium 
chloride and 0.5M mannitol. 

Protoplasts (0.5ml) In electroporation buffer were 
mixed on Ice with 0.5ml of electroporation buffer 
containing 40ng plasmid DNA plus lOOng sonicated 

25 salmon DNA. The protoplasts were electroporated 
on Ice with a 350 volt. 70msec pulse. The protoplasts 
were incubated another 10 min on ice, then diluted 
into 10ml Murasige-Skoog (MS) medium and incu- 
bated at room temperature for 24h. 

30 Protoplasts were pelleted by centrifugatlon for 5 
min at 60 xg. The supernatant fluid was removed and 
the protoplasts washed once with MS medium. The 
protoplast pellet was resuspended In 200mJ Buffer A 
{0.25M Tris, pH7.8, ImM EDTA. ImM p-mercap- 

35 toethanol) and transferred to a microcentrifuge tube. 
Protoplasts were disrupted by sonlcatlon for 5-10 
sec at the lowest setting. Protoplast debris was 
pelleted by centrifugatlon for 5 min at 4*0. The 
supernatant fluid was removed, heated to 65° C for 

40 10 min and stored at -20° C. 

0. Assay for CAT activity In transformed protoplasts 
Allquots (IOOp.1) of the electroporated protoplast 
extract (extract of cells transformed with recombl- 
45 nant DNA) were added to 80^1 of Buffer A and 20^1 of 
a mix of 20^1 i^^C-chloramphenlcoi (40-60 mCi/mM). 
2mg acetyl CoA and 230iil Buffer A. The reaction was 
incubated for 90 min at 37** C. The reaction products 
were extracted with 600jil ethyl acetate and were 
50 concentrated by evaporating the ethyl acetate and 
resuspending in 10pJ ethyl acetate. The reaction 
products were separated by thin layer chromato- 
graphy using chloroformrmethanol (95:5,v/v) solvent 
and were detected by autoradiography. 
55 Transformation of host cells was determined by 
measuring the amount of enzymatic activity ex- 
pressed by the structural gene contained within the 
promoter-gene fusion construct. In this example, the 
structural gene encoding chloramphenicol acetyl 
60 transferase was employed In the DNA construct. To 
test the efficacy of the promoter utilized in the 
recombinant DNA fusion construct, parallel electro- 
poratlons were carried out, utilizing either the maize 
ubiquitin promoter-CAT gene fusion pUC-CAT (de- 
65 scribed herein and In Rgure 4) and pCaMV-CAT, a 
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cauliflower mosaic virus 35S promoter-CAT gene 
fusion (Fromm et al. (1985) Proc. Nail. Acad. Sci. 
USA 82:5824-5828) obtained from V. Walbot, Stan- 
ford University. As illustrated in Figure 5, in oat 
protoplasts the ubiqultin promoter is "stronger" than 
the CaMV promoter, as judged by the amount of 
enzymatic activity expressed. 

Example 3: Heat Shock Response 

A. Heat Shock Treatment 

To heat shock. 4- to 5-day old etiolated seedlings 
were transferred to an Incubator at 42° C and 
harvested 1 . 3 and 8h after transfer. Total RNA {7\iQ) 
was Isolated, denatured and electrophoresed 
through a I.50/0 agarose 30/o formaldehyde gel. The 
RNA was transfen-ed to Gen© Screen and probed 
with single stranded RNA transcribed from li- 
nearized pCA210 using SP6 RNA polymerase. (The 
recombinant plasmld. pCA210. was constructed by 
subcloning the 975 bp insert of p6R7.2b1 into pSP64 
(Promega) so that SP6 RNA polymerase syn- 
thesized an RNA probe specific for hybridization 
with ubiqultin mRNA.) After autoradiography, the 
bands were cut out and the amount pf radioactivity 
bound to the filter was determined by liquid 
scintillation. From analysis of the Northern blots, 
levels of three ubiqultin transcripts were determined. 

One hour after transfer to 42** C, the level of the 
2.1 kb transcript Increased 2.5 to 3 fold. An approxi- 
mately 2 fold increase was observed for the 1,6kb 
transcript, however, no Increase was seen for the 
0.8kb transcript. By three hours after transfer of the 
seedlings to elevated temperature, the levels of the 
two largest ubiqultin transcripts had returned to the 
level observed In unshocked tissue and remained at 
those levels for at least another five hours. The 
transitory nature of ubiquitin during the heat shock 
response In maize may indicate the ubiqultin has a 
specialized role In heat shock and that only brief 
periods of increased levels of ubiquitin are required, 

B. Heat Shock Sequences 

The nucleotide sequence of the maize ubiquitin 
gene Is presented in Rgure 2. Within the promoter 
region are nucleotide sequences homologous to the 
consensus heat shock sequence that has been 
shown to confer stress inducibility when placed 
upstream of heterologous promoters (Pelham 
(1982) supra ). The consensus sequence for the 
Drosophila heat shock element Is 
5'-CTG G AAT__TTCTAGA-3' 

and is generally found approximately 26 bases 
upstream of the transcriptional start site. 

Located within 900 bases 5' to the transcriptional 
. start site of the maize ubiqultin promoter are two 
overiapping heat shock sequences: 
5'-CTGGA CCCCTCTCGA-3' starting at nucleotide 

-214, and , 
5'-CTCGA GAQTrCCGCT-3' starting at nucleotide 
-240. The ubiquitin promoter from chicken embryo 
fibroblasts was also found to contain two overlap- 
ping heat shock consensus promoter sequences : 
5'-CTCGA ATCTTCCAG-3' starting at nucleotide 
-369. and 



5'-CCAGA GCnrCTTrr-S' starting at nucleotide 
-359 The 5' flanking region of the yeast ubiquitin 
gene UB14 (E. Ozkaynak et al. (1987) supra) 
comprises an 18 kb. rotationally symmetric (palind- 
5 romic) sequence. 5'-T TCTAGAACGTTCTAGA A-y. 
365 bases upstream of the translation start site. The 
middle 14 bases (underlined) of this 18 bp sequence 
contain an exact homology to the rotationally 
symmetric consensus *heat shock box' nucleotide 
10 sequence starting at approximately 284 nucleotides 
upstream of the presumed transcription start site. 

The relative position of the heat shbck sequence 
with respect to the transcriptional initiation codon 
and Its ultimate consequence on the magnitude of 
15 the induction response to heat shock or other stress 
remains largely unknown, although it has been 
suggested (U. Bond et al. (1986) supra ) that the 
further a heat shock element is located 5' from the 
transcriptional start site, the smaller is the level of 
20 Induction in response to stress. 

In this Invention It Is assumed that a heat shock 
sequence may be arbitrarily positioned at different 
loci within the ubiquitin promoter and that it may be 
chemically altered in sequence or be replaced with a 
26 synthetic homologous sequence, so long as the 
modified promoter sequence retains ubiquitin pro- 
moter function, which comprises the initiation, 
direction and regulation of transcription under stress 
and non-stress conditions. Biochemical techniques 
30 required to insert and/or delete nucleotides and to 
manipulate the resultant DNA fragments are well- 
known to those skilled In the art of genetic 
redesigning. 

35 Example 4: Presence of Heat Shock S equence(s) 
and a Large Intron Within the U biauitin Promoter 

The ubiquitin promoter from maize is charac- 
terized structurally by the presence of two overiap- 
ping heat shock sequences approximately two 
40 hundred bp upstream of the transcriptional start site 
and that of a large (approximately Ikb) intron 
between the transcriptional start site, and the 
translatlonal initiation codon. This promoter struc- 
ture Is very similar to that reported (U. Bond et al. 
45 (1986) supra) for the ubiquitin promoter from 
chlckenimb?yo fibroblasts In which two overiapping 
heat shock sequences are located approximately 
350 bp upstream of the transcriptional start site and 
a 674 bp intron is contained between the transcrip- 
50 tlonal and transiational initiation codons. Receritly 
(E. Ozkaynak et al. 91987) supra ), the nucleotide 
sequence of the promoter region from yeast ubi- 
quHIn UB14 gene was determined and found to 
contain a heat shock sequence approximately 280 
55 bp upstream of the transcriptional start site, but this 
yeast ubiquitin promoter was devoid of a large Intron 
between the transcription and translation Initiation 
sites. However, two other yeast ubiquitin genes, 
which did contain Introns. were found to be lacking 
60 sequences homologous to the Pelham "heat shock 
box" sequence. 

Ubiquitin promoters have been shown to up-regu- 
late expression of ubiqultin In response to heat 
shock in yeast, chicken embryo fibroblasts and 
65 maize. In all three systems, the level of ubiqultin 
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mRNA Is elevated after heat shock treatment and 
the Increase in ublquitln level was determined In 
maize and chicken embryo fibroblasts to be approxi- 
mately 3 fold. This enhancement In ubiqultin ex- 
pression In response to heat shock is significantly 
less than that obtained with other heat shock genes. 
It was found In chicken embryo fibroblasts that the 
levels of ubiqultin mRNA in cells exposed to 45** C 
Increase by 2.5 fold over a 2.5h period, whereas the 
levels of HSP70 mRNA Increased 10 fold under the 
same heat shock conditions. Moreover, the relative 
Instability of ubiqultin mRNA during recovery of cells 
from a 3h heat shock (half-lif a of approximately 1 .6 to 
2h) was also found to differ significantly from that of 
HSP70 mRNAs which were found to bo stable. 

It is interesting to note that In contrast to ubiqultin 
promoters. HSP70 genes do not contain large 
Introns between the transcriptional and translational 
Initiation codons. Another difference between the 
ubiqultin promoter and other heat shock promoters 
is that ubiqultin is expressed both constitutively and 
Inductively, whereas expression of classlcaJ heat 
shock proteins occurs predominantly In response to 
heat shock or other stress. This Invention allows 
skilled workers knowledgeable In the art to modify 
ubiqultin promoter with respect to the composition 
sequence and position of both the Intron and the 
heat shock sequences in order to alter constitutive 
and/or inductive expression of ublquitln. Also, 
standard recombinant technology may be employed 
to reposition, as well as to chemically alter the 
nucleotide sequences within the maize ubiqultin 
promoter region In such a fashion as to retain or 
improve the promoter function of the resultant 
modified DNA. Testing for ubiquitin promoter func- 
tion may be carried out as taught in Example 2. 



9. A DNA sequence as In claim 1 wherein said 
heat shock element Is located upstream of the 
transcription start site and wherein said intron 
Is located downstream of the transcription start 

6 site. 

10. A DNA construct comprising: 

(a) a DNA sequence no larger than 2 kb, 
said DNA sequence comprising a plant 
ubiquitin regulatory system, wherein said 

IQ regulatory system contains a heat shock 

element and an Intron, and 

(b) a plant-expressible structural gene 
wherein said structural gene is placed 
under the regulatory control of said plant 

IS ubiquitin regulatory system. 

11. A DNA construct as In claim 10 wherein 
said structural gene codes for a gene that Is not 
under heat shock control In nature. 

12. A method for the constitutive expression of 
20 a structural gene and the selected stress-in- 
duced enhancement in expression of said 
structural gene In a plant cell comprising the 
steps of: 

(a) transforming said plant cell with a 
^5 DNA construct comprising a plant ubiquitin 

regulatory system, wherein Is found a heat 
shock element and an Intron, and a 
plant-expressible structural gene that is 
under the regulatory control of said plant 
30 ublquitln regulatory system, and 

(b) selectively applying stress condi- 
tions to said transformed plant cell thereby 
Inducing enhancement in expression of 
said structural gene. 



Claims 

40 

1 . A DNA sequence no larger than 2 kb. said 
DNA sequence comprising a plant ubiquitin 
regulatory system, wherein said regulatory 
system contains a heat shock element and an 
intron. ^ 

2. A DNA sequence as in claim 1 wherein said 
heat shock element Is at least 75% homologous 
to the heat shock consensus sequence. 

3. A DNA sequence as In claim 1 wherein said 

heat shock element Is not more than approxl- 50 
mately 214 nucleotides 5' of the transcription 
start site. 

4. A DNA sequence as in claim 3 wherein said 
regulatory system contains two heat shock 
elements. ^ 

5. A DNA sequence as in claim 4 wherein said 
heat shock elements are situated so as to 
overiap. 

6. A DNA sequence as in claim 1 wherein said 
Intron Is approximately 1 kb in length. « 

7. A ONA sequence as in claim 6 wherein said 
intron is that of the maize ubiqultin promoter. 

8. A DNA sequence as in claim 6 wherein said 
Intron is located not more than 83 nucleotides 3' 

of the transcription start site. 6i 
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